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{ * | *F | * } 


This provides three choices: *, *+, or *-. 


Unless the list is enclosed in square brackets, you must choose one item 
from the list. . 


C4 Some directives can have a varying number of parameters. For example, 
the .byte directive can have up to 100 parameters. The syntax for this di- 
rective is: 


-byte value, [, ..., Valuen] 


This syntax shows that .byte must have at least one value parameter, but 
you have the option of supplying additional value parameters, separated 
by commas. 


Information About Cautions and Warnings 


viii 


This book may contain cautions and warnings. 


Cy Acaution describes a situation that could potentially damage your soft- 
ware or equipment. 


Ci ~Awarning describes a situation that could potentially cause harm to you. 


The information in a caution or a warning is provided for your protection. Please 
read each caution and warning carefully. 
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1.2 General Description 


The TMS320C5x generation consists of the TMS320C50 and TMS320C51 de- 
vices. These digital signal processors are fabricated in accordance with static 
CMOS integrated-circuit technology. Their architectural design is based upon 
that of the TMS320C25. The combination of an advanced Harvard architecture 
(separate buses for program memory and data memory), additional on-chip 
peripherals, more on-chip memory, and a highly specialized instruction set is 
the basis of the operational flexibility and speed of these DSP devices. 
TMS320C5x devices are designed to execute more than 28 MIPS (million in- 
structions per second). Spin-off devices with the core CPU and customized 
on-chip memory and peripheral configurations can be developed for special- 
ized areas of the electronics market. 


The TMS320C5x generation offers these advantages: 


Cy enhanced TMS320 architectural design for increased performance and 
versatility 

a modular architectural design for rapidly developing spin-off devices 
advanced IC processing technology for increased performance 
source-code compatibility with TMS320C1x and TMS320C2x DSPs for 
maintaining a roadmap between fixed-point processors and for protecting 
the TMS320 design investments 

enhanced TMS320 instruction set for faster algorithms and for optimized 
high-level language operation 

new static design techniques for minimizing power consumption and maxi- 
mizing radiation hardness 


Ooo 


Table 1—1 provides an overview of the TMS320C5x generation of digital signal 
processors. It shows the capacity of on-chip RAM and ROM memories, num- 
ber of serial and parallel I/O ports, execution time of one machine cycle, and 
type of package with total pin count. The chart should help you choose the best 
processor for an application. 


The following subsections summarize features of the TMS320C5x processors. 
The description of the CPU applies to all TMS320C5x-generation members 
(current and future). At this time, however, descriptions of the remaining fea- 
tures pertain only to the TMS320C50 and/or the TMS320C51. Detailed infor- 
mation on their CPU, memory, and on-chip peripherals is given in Chapters 3, 
6, and 5, respectively. 
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1.3 Key Features 


At this time, the TMS320C5x generation consists of the TMS320C50 and the 
TMS320C51 digital signal processors. Key features of these DSPs are listed 
below. Where a feature is exclusive to a particular member, the member’s 
name is enclosed within a set of parentheses and noted after that feature. 


{ 35-50-ns single-cycle fixed-point instruction execution time 
(28.6 — 20 MIPS) 


Upward source-code compatible with all TMS320C1x and TMS320C2x 
devices. 


RAM-based memory operation (TMS320C50) 
ROM-based memory operation (TMS320C51) 


oO 


9K x 16-bit single-cycle on-chip program/data RAM (TMS320C50) 
1K x 16-bit single-cycle Aitchia piogranidata RAM (TMS320C51) 
2K x 16-bit single-cycle on-chip boot ROM (TMS320C50) 

8K x 16-bit single-cycle on-chip program ROM (TMS320C51) 
1056 x 16-bit dual-access on-chip data RAM 


Oodooda dod 


224K x 16-bit maximum addressable external memory space (64K pro- 
gram, 64K data, 64K I/O, and 32K global) 


o 


32-bit arithmetic logic unit (ALU), 32-bit accumulator (ACC), and 32-bit ac- 
cumulator buffer (ACCB) 


16-bit parallel logic unit (PLU) 
16 x 16-bit parallel multiplier with a 32-bit product capability 


Single-cycle multiply/accumulate instructions 


Of oa 


Eight auxiliary registers with a dedicated arithmetic unit for indirect ad- 
dressing 


co! 


Eleven context-switch registers (shadow registers) for storing strategic 
CPU-controlled registers during an interrupt service routine 


‘al 


Eight-level hardware stack 


Ch 0-to 16-bit left and right data barrel-shifters and a 64-bit incremental data 
shifter 


Ck =Two indirectly addressed circular buffers for circular addressing 
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memory-mapped core-CPU registers and 16 memory-mapped I/O ports. See 
Chapter 3 for more details. 


1.3.2 On-Chip ROM 


The TMS320C50 features a 2K x 16-bit on-chip, maskable, programmable 
ROM. This memory is used for booting from slower external ROM or EPROM 
of program to fast on-chip or external SRAM. ROM can be selected during re- 
set by driving the MP/MC pin low. Once your program has been booted into the 
RAM, this boot ROM can be operationally removed from the program memory 
space via the MP/MC bitin the PMST status register. If the ROM is not selected, 
the TMS320C50 starts its execution via an off-chip memory. 


The TMS320C51 features an 8K x 16-bit on-chip maskable ROM. You can use 
this memory for your specified program. Once the development of the program 
has stabilized, submit a ROM code to Texas Instruments for implementation 
into your device. See Chapter 6 for more details. 


1.3.3 On-Chip Data RAM 


Both TMS320C5x devices carry a 1056 x 16-bit on-chip data RAM. This RAM 
can be accessed twice per machine cycle (dual-access RAM) as long as both 
accesses are not write operations. This block of memory is primarily intended 
to store data values but, when needed, can be used to store programs as well 
as data. It can be configured in one of two ways: either all 1056 x 16 bits as 
data memory or 544 x 16 bits as data memory with 512 x 16 bits as program 
memory. You can select the configuration with the CNF bit in status register 
ST1. See Chapter 6 for more details. 


1.3.4 On-Chip Program/Data RAM 


The TMS320C50 has a 9K x 16-bit on-chip RAM. The TMS320C51 has a 1K 
x 16-bit on-chip RAM. This memory is software configurable as program and/or 
data memory space. Code can be booted from an off-chip nonvolatile memory 
and then executed at full speed, once it is loaded into this RAM. See Chapter 
6 for more details. 


1.3.5 On-Chip Memory Security 


The TMS320C5x generation has a maskable option to protect the contents of 
on-chip memories. When the related bit is set, no externally originating instruc- 
tion can access the on-chip memory spaces. See Chapter 6 for more details. 


1.3.6 Address-Mapped Software Wait-State Generators 


Software wait-state logic is incorporated without any external hardware into 
TMS320C65x for interfacing with slower off-chip memory and I/O devices. This 
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ing devices. Also, it can be used to test pin-to-pin continuity as well as to per- 
form operational tests on those peripheral devices that surround the 
TMS320C5x. It is interfaced to another internal scanning logic circuitry, which 
has access to all of the on-chip resources. Thus, the TMS320C5x can perform 
on-board emulation by means of the JTAG serial scan pins and the emulation- 
dedicated pins. See IEEE Standard P1149.1 for more details. 


1.3.12 TMS320C5x Package 


TMS320C5x devices are packaged in a 132-pin Quad Flat Pack package 
(QFP). With consideration for the pin layout of a TMS320C25 package, the 
TMS320C5x package is designed to minimize printed circuit board modifica- 
tions when a TMS320Cex processing system is upgraded to a TMS320C5x 
processing system. Signal call-outs for the TMS320C5x appear on the same 
side and in the same order as those for the TMS320C25. See Chapter 2 for 
details. 
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2 | Pinouts and Signal Descriptions 


Pin Layout 


2.1 Pin Layout 
Both the TMS320C50 and the TMS320C51 devices are packaged in a 132-pin 


Quad Flat Pack package (QFP) and have the same pin-to-signal relationship. 
Figure 2—1 shows the pin/signal call-outs for this package. 


Figure 2-1. Signal Assignments for TMS320C5x 132-Pin QFP 
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Note: NC = Noconnect. (These pins are reserved.) 
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Signal Descriptions 


Table 2~1. TMS320C5x Signal Descriptions (Continued) 
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Memory Control Signals 


Data, Program, and I/O space select signals. Always high 
unless low level asserted for communicating to a particular 
external space. Placed into a high-impedance state in hold 
mode. These signals also go into high-impedance whenOFF 
is active low. 


Data ready input. Indicates that an external device is pre- 
pared for the bus transaction to be completed. If the device 
is not ready (READY is low), the processor waits one cycle 
and checks READY again. READY also indicates a bus 
grant to an external device after a BR (bus request) signal. 


Read/Write signal. Indicates transfer direction during com- 
munication to an external device. Normally in read mode 
(high), unless lowlevel asserted for performing awrite opera- 
tion. Placed in high-impedance state in hold mode. This sig- 
nal also goes into high impedance when OFF is active low, 
and it is used in external DMA access of the 9K RAM cell. 
While HOLDA and IAQ are active low, this signal is used to 
indicate the direction of the data bus for DMA reads (high) 
and writes (low). 


Strobe signal. Always high unless asserted low toindicate an 
external bus cycle. Placed in high-impedance state in the 
hold mode. This signal also goes into high impedance when 
OFF is active low, and it is used in external DMA access of 
the 9K RAM cell or the 1K RAM cell on C51. While HOLDA 
and IAQ are active low, this signal is used to select the 
memory access. 


Read selectindicates an active, external read cycle and may 
connect directly to the output enable (OE) of external de- 
vices. This signal is active on all external program, data, and 
/O reads. Placed into high-impedance state in hold mode. 
This signal also goes into high impedance when OFF is ac- 
tive low. 


Write enable. The falling edge of this signal indicates that the 
device is driving the external data bus (D15—D0). Data may 
be latched by an external device on the rising edge of WE. 
This signal is active on all external program, data, and I/O 
writes. Placed into high-impedance state in hold mode. This 
signal also goes into high impedance when OF Fis active low. 
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Table 2-1. TMS320C5x Signal Descriptions (Continued) 
a ee eee 


Initialization, Interrupt, and Reset Operations 


41 External user interrupt inputs. Prioritized and maskable by the interrupt mask 
40 register and interrupt mode bit. Can be polled and reset via the interrupt flag 
=F register. 


Nonmaskable interrupt. External interrupt that cannotbe masked via the INTM 
or the IMR. When NMI is activated, the processor traps to the appropriate vec- 
tor location. 


127 


- ¥ 
_ CLKOUTI1 I 


CLKMD1 71 
CLKMD2 103 


X2/CLKIN1 iw 


Reset input. Causes the device to terminate execution and forces the program 
counter to zero. When RS is brought to a high level, execution begins at loca- 
tion zero of program memory. RS affects various registers and status bits. 


Microprocessor/Microcomputer mode select pin. If active low at reset (micro- 
computer mode), the pin causes the internal program ROM to be mapped into 
program memory space. In the microprocessor mode, all program memory is 
mapped externally. This pin is sampled only during reset, and the mode that 
is set at reset can be overridden via the software control bit MP/MC in the 
PMST register. 


Oscillator/Timer Signals 


Master clock output signal (CLKIN/2 or CLKIN2 frequency). This signal cycles 
at the machine-cycle rate of the CPU. The internal machine cycle is bounded 
by the rising edges of this signal. This signal also goes into high impedance 
when OFF is active low. 


CLKMDi CLKMD2 Clock Mode 


0 0 External clock with divide-by-two option. Input 
clock provided to X2/CLKIN1 pin. Internal oscilla- 
tor and PLL disabled. 


0 1 Reserved for test purposes. 


1 0 External divide-by-one option. Input clock pro- 
vided to CLKIN2. Internal oscillator disabled. 
Internal PLL enabled. 


Internal or external divide-by-two option. Input 
clock provided to X2/CLKIN1 pin. Internal oscilla- 
tor enabled. Internal PLL disabled. . 


Input pin to internal oscillator from the crystal. If the internal oscillator is not be- 
ing used, a clock may be input to the device on this pin. The internal machine 
cycle is half this clock rate. 


Output pin from the internal oscillator for the crystal. If the internal oscillator is 
notused, this pin should be left unconnected. This signal does not go into high 
impedance when OFF is active low. 
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Table 2-1. TMS320C5x Signal Descriptions (Continued) 
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Supply Pins (Concluded) 


Vssi4 | 102 | 8S | Ground for inputs and internal logic. 
Vgsis | 20 | S| Ground for inputs and internal logic. 
Vssi6 | i21f 6s] Ground for inputs and internal logic. 


Serial Port Signals 


CLKR 
TCLKR 


CLKX VO/Z 
TCLKX VO/Z 


DR 43 
TDR 44 


DX 106 O/Z 
TDX 107 


FSR I 
TFSR/TADD VO/Z 


Receive clock inputs. External clock signal for clocking data 
from the DR/TDR (data receive) pins into the RSR (serial port 
receive shift register). Must be present during serial port 
transfers. If the serial port is not being used, these pins can 
be sampled as an input via the INO bit of the SPC/TSPC reg- 
isters. 


Transmit clock. Clock signal for clocking data from the DR/ 
TDR (data receive register) to the DX/TDX (data transmit 
pin). The CLKX can be an input if the MCM bit in the serial 
port control register is set to 0. It may also be driven by the 
device at 1/4 the CLKOUT1 frequency when the MCM bit is 
set to 1. If the serial port is not being used, this pin can be 
sampled as an input via the IN1 bit ofthe SPC/TSPC register. 
This signal goes into high impedance when OFF is active 
low. 


Serial data receive inputs. Serial data is received in the RSR 
(serial port receive shift register) via the DR/TDR pin. 


Serial port transmit outputs. Serial data transmitted from the 
XSR (serial port transmit shift register) via the DX/TDX pin. 
Placed in high-impedance state when not transmitting and 
also when OFF is active low. 


Frame synchronization pulse for receive input. The falling 
edge of the FSR/TFSR pulse initiates the data receive pro- 
cess, beginning the clocking of the RSR. TFSR becomes an 
input/output (TADD) pin when the serial port is operating in 
TDM mode (TDM bit = 1). In TDM mode, this pin is used to 
output/input the address of the port. This signal goes into 
high impedance when OFF is active low. 
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Test Signals (Concluded) 


Emulator pin 1/disable all outputs. When TRST is driven low 
or not connected, this pin is configured as OFF. The EMU1/ 
OFF signal, when active low, puts all output drivers into the 
high-impedance state. Note that OFF is used exclusively for 
testing and emulation purposes (not for multiprocessing 
applications). When TRST is driven high, this pin is used as 
an interrupt to or from the emulator system and is defined as 
input/output via JTAG scan. 


RESERVED Reserved pin. These pins are reserved for future 


TMS320C5x devices. These pins should be left uncon- 
nected. 
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3.1. Architectural Overview 
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The TMS320C5x high-performance digital signal processors are designed, 
like the TMS320C25, with an advanced Harvard-type architecture that maxi- 
mizes the processing power by maintaining two separate memory bus struc- 
tures, program and data, for full-speed execution. Instructions support data 
transfers between the two spaces. 


The TMS320C5x performs twos-complement arithmetic, using the 32-bit ALU 
and accumulator. The ALU is a general-purpose arithmetic unit that operates 
by using 16-bit words taken from data memory or derived from immediate in- 
structions, or by using the 32-bit result from the multiplier. In addition to arith- 
metic operations, the ALU can perform Boolean operations. The accumulator 
stores the output from the ALU and is also the second input to the ALU. The 
accumulator is 32 bits in length and is divided into a high-order word (bits 31 
through 16) and a low-order word (bits 15 through 0). Instructions are provided 
for storing those high- and low-order accumulator words in memory. For fast, 
temporary storage of the accumulator, there is a 32-bit accumulator buffer. | 


In addition to the main ALU, there is a parallel logic unit (PLU) that executes 
logic operations on data without affecting the contents of the accumulator. The 
PLU provides the bit-manipulation ability required of a high-speed controller 
and simplifies the bit setting, clearing, and testing required with control and sta- 
tus register operations. 


The multiplier performs 16 x 16-bit twos-complement multiplication with a 
32-bit result in a single-instruction cycle. The multiplier consists of three ele- 
ments: multiplier array, PREG (product register), and TREGO (temporary reg- 
ister). The 16-bit TREGO temporarily stores the multiplicand; the PREG stores 
the 32-bit product. The multiplier’s values come from data memory, come from 
program memory when the MAC/MACD/MADS/MADD instructions are used, 
or are derived immediately from the multiply immediate instructions (MPY #). 
The fast on-chip multiplier allows the device to efficiently perform fundamental 
DSP operations such as convolution, correlation, and filtering. 


The TMS320C5x scaling shifter has a 16-bit input connected to the data bus 
and a 32-bit output connected to the ALU. The scaling shifter produces a left 
shift of 0 to 16 bits on the input data, as programmed in the instruction or de- 
fined in the shift count register (TREG1). The LSBs of the output are filled with 
zeros, while the MSBs may be either zero-filled or sign-extended, depending 
upon the state of the sign-extension mode bit (SXM) of status register ST1. Ad- 
ditional shift capabilities enable the processor to perform numerical-scaling, 
bit-extraction, extended-arithmetic, and overflow-prevention operations. 


Eight levels of hardware stack are provided for saving the contents of the pro- 
gram counter during interrupts and subroutine calls. On interrupts, the strate- 
gic registers (ACC, ACCB, ARCR, INDX, PMST, PREG, STO, ST1, TREGs) 
are pushed onto a one-deep stack and popped upon interrupt return, thus pro- 
viding a zero-overhead interrupt context switch. 
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Functional Block Diagram 
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Figure 3-1. Block Diagram of TMS320C5x Internal Hardware 
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Table 3-1. | TMS320C5x Internal Hardware (Continued) 


A 16-bit memory-mapped register containing the start address of the seg- 
ment of code being repeated. See subsection 3.6.5 for more details. 


A16-bit memory-mapped counter register used to limit the number of times 
the block is to be repeated. See subsection 3.6.5 for more details. 


A buffered interface used to pass data between the internal data and pro- 
gram buses. 


Bus Request This signal indicates that a data access is mapped to global memory space 
as defined by the GREG register. See Section 6.3 for more details. 


Carry This bit stores the carry output of the ALU. This bit resides in ST1. See sub- 
section 3.5.2 for more information. 


Central Arithmetic Logic Unit The grouping of the ALU, multiplier, accumulator, and scaling shifters. See 
Section 3.5 for more information. 


Circular Buffer An 8-bit register used to enable/disable the circular buffers and define 
Control Register which auxiliary registers are mapped to the circular buffers. See subsection 
3.4.3 for more information. 


Circular Buffer Two 16-bit registers indicating circular buffer end addresses. CBER1 and 
End Address CBER2 are associated with circular buffers one and two, respectively. See 
CBER2(16) | subsection 3.4.3 for more information. 


Circular Buffer CBSR(16) Two 16-bit registers indicating circular buffer start addresses. CBSR1 and 


Start Address CBSR1(16) | CBSR2 are associated with circular buffers one and two, respectively. See 
CBSR2(16) | subsection 3.4.3 for more information. 


the PC. See subsection 3.4.3 for more information. 


Configure Ram CNF This bit indicates whether on-chip dual-access RAM blocks are mapped to 
program or data space. The CNF bit resides in ST1. See subsection 3.6.3 
for more information. . 

DATA A 16-bit bus used to route data. 
Data Memory DATA This block refers to data memory used with the core and defined in specific 
MEMORY device descriptions. It refers to both on- and off-chip memory blocks in data | 

memory space. 

Data Memory _ DATA A 16-bit bus that carries the address for data memory accesses. 

Address Bus ADDRESS 


Data Memory Address DMA(7) A 7-bit register containing the immediate relative address within a 


Compare of Program COMPARE This circuit compares the current value in the PC to the value in PAER if 
Address BRAF is active. If the compare shows equal, then the PASR is loaded into 


i i 128-word data page. See subsection 3.4.2 for more information. 


Immediate Register 
Data Memory DP(9) A9Q-bit register containing the address of the current page. Data pages are 
Page Pointer 128 words each, resulting in 512 pages of addressable data memory space 


(some locations are reserved). See subsection 3.4.2 for more information. 


Data RAM Map Bit RAM(1) This bit indicates if the single-access RAM is mapped into data space. See 
subsection 3.6.3 for more information. 
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Table 3-1. | TMS320C5x Internal Hardware (Continued) 


Abus multiplexer used to select the source of operands for a bus or execu- 
tion unit, depending on the nature of the current instruction. 


Multiplexer MUX 


MULTIPLIER | A16x 16-bit parallel multiplier. See subsection 3.5.3 for more information. 


Overflow Flag OV(1) This bit resides in STO and indicates an overflow in an arithmetic operation 
in the ALU. See subsection 3.6.3 for more information. 


This bit resides in STO and determines whether an overflow inthe ALU will 


Overflow Mode OVM(1) 


Overlay to Data Space OVLY(1) 


wrap around or saturate. See subsection 3.6.3 for more information. 


This bit resides in the PMST register and determines whether the on-chip 
single-access memory will be addressable in data address space. See 
subsection 3.6.3 for more information. 


Parallel Logic Unit PLU 


A16-bit logic unit that executes logic operations from either long immediate 
operands or the contents of the DBMR directly upon data locations without 
interfering with the contents of the CALU registers. See Section 3.7 for 


more information. 


Prefetch Counter 


A 16-bit counter used to prefetch program instructions. The PFC contains 
the address of the instruction currently being prefetched. Itis updated when 
a new prefetch is initiated. The PFC can also address program memory 
when the block move (BLPD), multiply-accumulate (MAC/MACD), and 
table read/write (TBLR/TBLW) instructions are used and can address data 
memory when the block move (BLDD) instruction is used. 


Prescaler Count Register COUNT(4) A four-bit register that contains the value for the prescaling operation. 
When the register contents are used as prescaling data, this register is 
loaded from the dynamic shift count or from the instruction. In conjunction 
with the BIT and BITT instructions, this register is loaded from the dynamic 


bit pointer or the instruction word. 


A 32-bit product register used to hold the multiplier’s product. The high and 
low words of the PREG can be accessed individually. See subsection 3.5.3 
for more information. 


A 16-bit bus used to route instructions (and data for the MAC and MACD 
instructions). 


A 16-bit program counter used to address program memory sequentially. 
The PC always contains the address of the next instruction to be fetched. 
The PC contents are updated following each instruction decode operation. . 


Product Register PREG(32) 


Program Counter PC(16) 
This block refers to program memory used with the core and defined in spe- 
cific device descriptions. It refers to both on- and off-chip memory blocks 


Program Memory PROGRAM 
MEMORY 
accessed in program memory space. 
Program Memory PROG A 16-bit bus that carries the program memory address. 
Address Bus ADDRESS 


Prescaling Shifter PRESCALER | A0O- to 16-bit left barrel shifter used to prescale data coming into the ALU. 
Also used to align data for multiprecision operations. This shifter is also 
used as a 0- to 16-bit right barrel shifter of the ACC. See subsection 3.5.2 
for more information. 
Postscaling Shifter A 0- to 7-bit left barrel shifter used to postscale data coming out of the 
CALU. See subsection 3.5.2 for more information. 
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3.4 Internal Memory Organization 


This section describes the memory use of the TMS320C5x core and the ad- 
dressing modes supported by the core. 


3.4.1 Memory-Mapped Registers 


Twenty-eight core processor registers are mapped into the data memory 
space. These are listed in Table 3-2. An additional 64 memory-mapped regis- 
ters are reserved in page 0 of data space. These data memory locations are 
reserved for peripheral control registers, which are described in Chapter 5. 


Description 


Reserved 

Interrupt mask register 
Global memory allocation register 

Interrupt flag register 

Processor mode status register 

Repeat counter register 

Block repeat counter register 

Block repeat program address start register 
Block repeat program address end register 
Temporary register for multiplicand 
Temporary register for dynamic shift count 
Temporary register used as bit pointer 

in dynamic bit test 

Dynamic bit manipulation register 

Auxiliary register zero 

Auxiliary register one 

Auxiliary register two 

Auxiliary register three 

Auxiliary register four 

Auxiliary register five 

Auxiliary register six 

Auxiliary register seven 

Index register 

Auxiliary register compare register 

Circular buffer 1 start address register 
Circular buffer 1 end address register 
Circular buffer 2 start address register 
Circular buffer 2 end address register 
Circular buffer control register 

Block move address register 


Table 3-2. Memory-Mapped Registers 


wo 


MOOWPYPOONODOA ! 


Oe ee eee eee ee See ee ee Ce Co Se | 
TAaNMODOWDPOCMRINHHaONM—O 7 


3-10 Architecture 


Internal Memory Organization 


Figure 3-2. Direct Addressing Mode 


ADD 010h 
Machine Code = 0010 SHFETt lj001 oO000 
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DRAB = [1100 1110 1/001 0000 
Operand = Data(DRB) 


t SHFT represents a 4-bit shift value. 


Memory-mapped addressing mode operates much like direct addressing 
mode except that the most significant 9 bits of the address are forced to zero 
instead of being loaded with the contents of the DP. This allows the user to di- 
rectly address the memory-mapped registers of data page zero without the 
overhead of changing the DP or auxiliary register. Figure 3-3 illustrates 
memory-mapped addressing mode. . 


Figure 3-3. Memory-Mapped Addressing Mode 


LAMM PMST | 

MachineCode = [0000 1000 [0/000 011 1 

/ / 

/ / 

DP = 10000 0000 0 Fi / 

a ee / 

DRB = 0000 0000 0/000 0111 
Operand = Data(DRB) 
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In the case of the long immediate operand, the operand immediately follows 

the opcode in the program sequence. The long immediate operand is 16 bits 

long. Figure 3-6 shows an example of long immediate mode. In this example, 

the second word of the two-word instruction is added to the ACC by the CALU. 
Figure 3-6. Long Immediate Mode 


ADD = #01234h 


Machine Code 


0.0 0 1 0010 0011 0100 


Operand 


Operand = Data(second word(15 — 0)) 


The operand may come from a CPU register. This type of operand is used in 
special cases. The CALU uses this in multiplying with TREGO, in shifting with 
TREG1 and PM, and in bit manipulation with TREG2. The ARAU uses this with 
INDX and ARCR. The PLU uses this with DBMR. Figure 3-7 illustrates the use 
of the DBMR register as an AND mask in the APL instruction. 


Figure 3-7. Register Access Mode 


APL 010h 
Machine Code = 07101 1010 0} 0 0 1 000 0. 
/ / 
FA gf 
DP = 110 0 1110 1 7 7 
/ / 
\ i of 
DRB- = 1100 1110 1/0 O 1 0000 


Operand1 = Data DRB) 
Operand2 = DBMR 


‘Inthe long immediate addressing mode, an operand is addressed by the sec- 
ond word of a two-word instruction. In this case, the program address/data bus 
(PAB) is used for the operand fetch. The PC is stored in a temporary register, 
and the long immediate value is loaded into the PC. Then, the PAB is used for 
the operand fetch or write. At the completion of the instruction, the PC is re- 
stored from the temporary register, and execution continues. This technique 
is used when two memory addresses are required for the execution of the in- 
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Figure 8-9. Registered Block Memory Addressing Mode 
BLDD BMAR, 012h 


Machine Code 1 = 1010 1100 o0J001 0010 
/ 
/ ys 
pe = [1100 f110 1 ve y 
/ / 
\ Ma / 


DRB- = 1100 1110 1/0 0 1 0010 


Operand2 = Data (DRB) 
BMAF] __» 
Operand1 = Data (PC) 


3.4.3 Auxiliary Registers 


The TMS320C5x provides a register file containing eight auxiliary registers 
(ARO-—AR7). The auxiliary registers may be used for indirect addressing of the 

_data memory or for temporary data storage. Indirect auxiliary register address- 
ing (see Figure 3—10) allows placement of the data memory address of an in- 
struction operand into one of the auxiliary registers. These registers are 
pointed to by a three-bit auxiliary register pointer (ARP) that is loaded with a 
value from 0 through 7, designating ARO through AR7, respectively. The auxil- 
iary registers and the ARP may be loaded from data memory, the accumulator, 
the product register, or by animmediate operand defined in the instruction. The 
contents of these registers may also be stored in data memory or used as in- 
puts to the CALU. These registers appear in the memory map as described in 
Table 3-2. 
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Figure 3-11. Auxiliary Register File 
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The index register (INDX) can be added to or subtracted from AR(ARP) on any 
AR update cycle. This 16-bit register is one of the memory-mapped registers 
and is used to increment or decrement the address in steps larger than one, 
which is useful for operations such as addressing down a column of a matrix. 
The auxiliary register compare register (ARCR) is used as a limit to blocks of 
data and, in conjunction with the CMPR instruction, supports logical compari- 
sons between AR(ARP) and ARCR. The TMS320C25 uses ARO for these two 
functions. After reset, a LAR load of ARO also loads INDX and ARCR to main- 
tain compatibility with the TMS320C25. The splitting of functions to the three 
registers is enabled by setting the NDX bit of PMST to one. 


Because the auxiliary registers are memory-mapped, they can be acted upon 
directly by the CALU to provide for more advanced indirect addressing tech- 
niques. For example, the multiplier can be used to calculate the addresses of 
three-dimensional matrices. After a CALU load of the auxiliary register, there 
is, however, a two-instruction-cycle delay before auxiliary registers can be 
used for address generation. The INDX and ARCR registers are accessible via 
the CALU, regardless of the condition of the NDX bit (i.e., SAMM ARCR writes 
only to the ARCR). 


In addition to its use for address manipulation in parallel with other operations, 
the ARAU may also serve as an additional general-purpose arithmetic unit be- 
cause the auxiliary register file can directly communicate with data memory. 
The ARAU implements 16-bit unsigned arithmetic, whereas the CALU imple- 
ments 32-bit twos-complement arithmetic. The BANZ and BANZD instructions 
permit the auxiliary registers to be used as loop counters, also. 


The 3-bit auxiliary register pointer buffer (ARB), shown in Figure 3-11, provides 
storage for the ARP on subroutine calls when the automatic context switch 
compatibles of the device are not used. 


Two circular buffers can operate at a given time and are controlled via the circu- 
lar buffer control register (CBCR). The CBCR is defined as follows: 


Identifies which auxiliary register is mapped to circular buffer 1. 


Circular buffer 1 enable=1/disable=0. Set to 0 upon reset. 
Identifies which auxiliary register is mapped to circular buffer 2. 
Circular buffer 2 enable=1/disable=0. Set to 0 upon reset. 


Upon reset (RS rising edge), both circular buffers are disabled. To define a cir- 
cular buffer, load the CBSR1/2 with the start address of the buffer and 
CBER1/2 with the end address, and load the auxiliary register to be used with 
the buffer with an address between the start and end addresses. Finally, load 
CBCR with the appropriate auxiliary register number and set the enable bit. 
Note that the same auxiliary register can not be enabled for both circular buff- 
ers, or unexpected results will occur. As the address is stepping through the 
circular buffer, the auxiliary register value is compared against the value con- 
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The TMS320C5x central arithmetic logic unit (CALU) contains a 16-bit scaling 
shifter, a-16 x 16-bit parallel multiplier, a 32-bit arithmetic logic unit (ALU), a 
32-bit accumulator (ACC), a 32-bit accumulator buffer (ACCB), and additional 
shifters at the outputs of both the accumulator and the multiplier. This section 
describes the CALU components and their functions. Figure 3—12 is a block 
diagram showing the components of the CALU. The following steps occur in 
the implementation of a typical ALU instruction: 


1) Data is fetched from the RAM on the data bus, 


2) Datais passed through the scaling shifter and the ALU where the arithme- 
tic is performed, and 


3) The result is moved into the accumulator. 


One input to the ALU is always provided by the accumulator. The other input 
may be transferred from the product register (PREG) of the multiplier, the accu- 
mulator buffer (ACCB), or the scaling shifter that is loaded from data memory 
or the accumulator (ACC). 
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3.5.1 Prescaling Shifter | 


The TMS320C5x provides a scaling shifter that has a 16-bit input connected 
to the data bus and a 32-bit output connected to the ALU; see Figure 3-12. The 
scaling shifter produces a left shift of 0 to 16 bits on the input data. The shift 
countis specified by a constant embedded in the instruction word or by the val- 
ue in TREG1. The LSBs of the output are filled with zeros; the MSBs may be 
either filled with zeroes or sign-extended, depending upon the value of the 
SXM bit (sign-extension mode) of status register ST1. 


The TMS320C5x also contains several other shifters that allow it to perform 
numerical scaling, bit extraction, extended-precision arithmetic, and overflow 
prevention. These shifters are connected to the output of the product register 
and the accumulator. 


3.5.2 ALU and Accumulator 


The TMS320C5x 32-bit ALU and accumulator implement a wide range of arith- 
metic and logical functions, the majority of which execute in a single clock 
cycle. Once an operation is performed in the ALU, the result is transferred to 
the accumulator where additional operations, such as shifting, may occur. Data 
that is input to the ALU may be scaled by the prescaling shifter. 


The ALU is a general-purpose arithmetic/logic unit that operates on 16-bit 
words taken from data RAM or derived from immediate instructions. In addition 
to the usual arithmetic instructions, the ALU can perform Boolean operations, 
facilitating the bit manipulation ability required of a high-speed controller. One 
input to the ALU is always supplied by the accumulator, and the other input may 
be furnished from the product register (PREG) of the multiplier, the accumula- 
tor buffer (ACCB), or the output of the scaling shifter (that has been read from 
data RAM or from the ACC). After the ALU has performed the arithmetic or log- 
ical operation, the result is stored in the accumulator. For the following.exam- 
ple, assume ACC = 0, PREG = 000222200h, PM = 00,-and ACCB = 
000333300h: 


LACC #01111h,8 7;ACC = 00111100. Load ACC from pre- 
;scaling shifter. 


APAC 7ACC = 00333300. Add to ACC the 
;product register. 
ADDB 7;ACC = 00666600. Add to ACC the 


saccumulator buffer. 


The 32-bit accumulator (ACC) can be split into two 16-bit segments for storage 
in data memory; see Figure 3-12. Shifters at the output of the accumulator pro- 
vide a left shift of 0 to 7 places. This shift is performed while the data is being 
transferred to the data bus for storage. The contents of the accumulator remain 
unchanged. When the postscaling shifter is used on the high word of the accu- 
mulator (bits 16-31), the MSBs are lost and the LSBs are filled with bits shifted 
in from the low word (bits 0 — 15). When the postscaling shifter is used on the 
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Central Arithmetic Logic Unit (CALU) 
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more efficient computation of extended-precision products and additions or 
subtractions. It is quite useful in overflow management. The carry bitis affected 
by most arithmetic instructions as well as the single-bit shift and rotate instruc- 
tions. It is not affected by loading the accumulator, logical operations, or other 
such non-arithmetic or control instructions. Examples of carry bit operations 
are shown in Figure 3-13. 


Figure 3-13. Examples of Carry Bit Operations 
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C MSB LSB C MSB LSB 

X FFFFFFFFACC X 00000000acc 
ees eee ae eee! 1 EE Sk ec a ee a 

1 00000000 0 FFFFFFFF 

C MSB LSB C MSB LSB 

X 7FFFFFFFACC X 80000001 acc 

+ = - = 

0 80000000 1 7FFFFFFF 

C MSB LSB C MSB LSB 

1 00000000acc 0 FFFFFFFFACC 

0 00000001 1 FFFFFFFd 


Shown in the examples of Figure 3-13, the value added to or subtracted from 
the accumulator may come from the input scaling shifter, ACCB, or PREG. The 
carry bit is set if the result of an addition or accumulation process generates 
acarry; it is reset to zero if the result of a subtraction generates a borrow. 
Otherwise, it is cleared after an addition or set after a subtraction. 


The ADDC (add to accumulator with carry) and SUBB (subtract from accumu- 
lator with borrow) instructions use the previous value of carry in their addition/ 
subtraction operation. The ADCB (add ACCB to accumulator with carry) and 
the SBBB (subtract ACCB from accumulator with borrow) also use the pre- 
vious value of carry. 


The one exception to operation of a carry bit, as shown in Figure 3-13, is in 
the use of ADD with a shift count of 16 (add to high accumulator) and SUB with 
a shift count of 16 (subtract from high accumulator). This case of the ADD in- 
struction can set the carry bit only if a carry is generated, and this case of the 
SUB instruction can reset the carry bit only if a borrow is generated; otherwise, 
neither instruction affects it. 


Two conditional operands, C and NC, are provided for branching, calling, re- 
turning, and conditionally executing according to the status of the carry bit. The 
CLRC, LST #1, and SETC instructions can also be used to load the carry bit. 
The carry bit is set to one on a hardware reset. 


The SFL and SFR (in-place one-bit shift to the left/right) instructions and the 
ROL and ROR (rotate to the left/right) instructions shift or rotate the contents 
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fractional arithmetic, or justifying fractional products. The PM field of status 
register ST1 specifies the PM shift mode, as shown in Table 3-3. 


Table 3-3. Product Shift Modes 


[Pa | Resuting Sait] 
00 | Noshin 
[or | totsnitorren = 
[10 [_Letshitofatits 


Right shift of 6 bits 


The productis shifted one bit to compensate for the extra sign bit gained in mul- 
tiplying two 16-bit twos-complement numbers (MPY). The four-bit shift is used 
in conjunction with the MPY instruction with a short immediate value (13 bits 
orless) to eliminate the four extra sign bits gained in multiplying a16-bit number 
times a 13-bit number. The output of PREG can, instead, be right-shifted 6 bits 
to enable the execution of up to 128 consecutive multiply/accumulates without 
the possibility of overflow. Note that, when the right shift is specified, the prod- 
uct is always sign-extended, regardless of the value of SXM. 


The LT (load TREGO) instruction normally loads TREGO to provide one oper- 
and (from the data bus), and the MPY (multiply) instruction provides the sec- 
ond operand (also from the data bus). A multiplication can also be performed 
with a short or long immediate operand by using the MPY instruction with an 
immediate operand. A product can be obtained every two cycles except when 
a long immediate operand is used. 


Four multiply/accumulate instructions (MAC, MACD, MADD, and MADS) fully 
utilize the computational bandwidth of the multiplier, allowing both operands 
to be processed simultaneously.The data for these operations can be trans- 
ferred to the multiplier each cycle via the program and data buses. This facili- 
tates single-cycle multiply/accumulates when used with repeat (RPT and 
RPTZ) instructions. In these instructions, the coefficient addresses are gener- 
ated by the PC, while the data addresses are generated by the ARAU. This al- 
lows the repeated instruction to sequentially access the values from the coeffi- 
cient table and step through the data in any of the indirect addressing modes. 
The RPTZ instruction also clears the accumulator and the product register to 
initialize the multiply/accumulate operation. As an example, consider multiply- 
ing the row of one matrix times the column of asecond matrix. For this example, 
consider 10 x 10 matrices, MTRX1 points to the beginning of the first matrix, 
INDX = 10, and AR(ARP) points to the beginning of the second matrix: 


RPTZ #9 7For i= 0, i< 10, it=. 

MAC MTRX1,*0+ ;PREG = DATA(MTRX1 + i) x DATA[MTRX2 + (i x INDX)]. 
3ACC += PREG. 

APAC ;ACC += PREG. 


The MAC and MACD instructions obtain their coefficient pointer from a long 
immediate address and are, therefore, two-word instructions. The MADS and 
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3.6 System Control 


System control on the TMS320C5x is provided by the program counter, hard- 
ware stack, PC-related hardware, external reset signal, interrupts (see Section 
3.8), status registers, and repeat counters. The following subsections describe 
the function of each of these components in system control and pipeline opera- 
tion. 


3.6.1 Program Address Generation and Control 


The TMS320C5x has a 16-bit program counter (PC) and an eight-deep hard- 
ware stack for PC storage. The program counter addresses internal and exter- 
nal program memory in fetching instructions. The stack is used during inter- 
rupts and subroutines. 


The program counter addresses program memory, either on-chip or off-chip, 
via the program address bus (PAB). Through the PAB, an instruction is ad- 
dressed in program memory and loaded into the instruction register (IREG). 
When the IREG is loaded, the PC is ready to start the next instruction fetch 
cycle. 


The PC can be loaded in a number of ways. When code is sequentially ex- 
ecuted, the PC is loaded with PC + 1. When a branch is executed, the PC is 
loaded with the long immediate value directly following the branch instruction. 
In the case of a subroutine call, the PC+2 is pushed onto the stack and then 
loaded with the long immediate value directly following the call instruction. The 
return instructions pop the stack back into the PC to return to the calling or inter- 
rupting sequence of code. In the case of a software trap or interrupt trap, the 
PC is loaded with the address of the appropriate trap vector. The contents of 
the accumulator may be loaded into the PC in order to implement computed 
GOTO operations. This can be accomplished with the BACC (branch to ad- 
dress in accumulator) or CALA (call subroutine at location specified by ACC) 
instructions. 


The PAB bus can also address data stored in either program or data space. 
This makes it possible, in repeated instructions, to fetch a second operand in 
parallel with the data bus for two-operand operations. When repeated, the 
array addressed by the PAB is sequentially accessed via the incrementing of 
the PC. The block transfer instructions (BLDD, BLDP, and BLPD) use both 
buses so that, when repeated, the pipeline structure can be reading the next 
operand while writing the current one. The BLPD instruction loads the PC with 
either the long immediate address following the BLPD or with the contents of 
the block move address register (BMAR). The PAB bus is then used to fetch 
the source data from program space in this block move operation. The BLDP 
executes much the same except that the PAB bus is used for the destination 
operation. The BLDD instruction uses the PAB bus to address data space. 


The TBLR and TBLW instructions operate much like the BLPD and BLDP in- 
structions, respectively, except that the PC is loaded with the low 16 bits of the 
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the PC is loaded with the second word and the core CPU starts refilling the 
pipeline with instructions at the branch address. Because the pipeline has 
been flushed, the branch instruction has an effective execution time of four 
cycles if the branch is taken. If, however, any of the conditions are not met, the 
pipeline controller allows the next instruction (already fetched) to be decoded. 
This means that if the branch is not taken, the effective execution time of the 
branch is two cycles. 


The subroutine call can also be executed conditionally. The CC instruction op- 
erates like the BOND except that the PC pointing to the instruction following 
the CC is pushed onto the PC stack. This sets up the return (by RET) to pop 
the stack to return to the calling sequence. A subroutine or function can have 
multiple return paths based upon the data being processed. Using conditional 
returns (RETC) avoids the need for conditionally branching around the return. 
For example, 


CC OVER_FLOW, OV ;If overflow,then execute the 
. ;overflow-handling routine. 


OVER_FLOW ;Overflow-handling routine. 
RETC GEQ 7;If ACC >= 0, then return. 
RET #Return. 


Inthe example, an overflow-handling subroutine is called if the main algorithm 
causes an overflow condition. During the subroutine, the ACC is checked and, 
if it is positive, the subroutine returns to the calling sequence. If not, additional 
processing is necessary before the return. Note that RETC, like RET, is a 
single-word instruction. However, because of the potential PC discontinuity, it 
still operates with the same effective execution time as BCND and CC. 


To avoid flushing the pipeline and causing extra cycles, the TMS320C5x has 
a full set of delayed branches, calls, and returns. In the delayed operation of 
branches, calls, or returns, the two-instruction words following the delayed in- 
struction are executed while the instructions at and following the branch ad- 
dress are being fetched—therefore, giving an effective two-cycle branch in- 
stead of flushing the pipeline. If the instruction following the delayed branch is 
a two-word instruction, then only it will be executed. For example, 


OPL #030h,PMST 
BCND NEW_ADRS,EQ 


or 


BCNDD NEW_ADRS,EQ 
OPL #030h,PMST. 
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to the XC and after the ADD so that the SPLK will not execute. In the second 
code segment, TEMP2is notset to EEEE. The NEQ status, caused by the ADD 
instruction, is established one full cycle before the XC execution phase be- 
cause the long immediate value (#01234h) used in the ADD caused it tobe a 
two-cycle instruction. Since the condition is not met, a NOP is forced over both 
words of the two-word SPLK instruction, and, therefore, TEMP2is not affected. 
Note that interrupts will have no effect on this instruction sequence. 


The TMS320C5x also has a feature that allows the execution of a single in- 
struction N + 1 times where N is the value loaded in a 16-bit repeat counter 
(RPTC). If the repeat feature is used, the instruction is executed and the RPTC 
is decremented until the RPTC goes to zero. This feature is useful with many 
instructions, such as NORM (normalize contents of accumulator), MACD (mul- 
tiply and accumulate with data move), and SUBC (conditional subtract). As in- 
structions repeat, the program address and data buses are freed to fetch a 
second operand in parallel with the data address and data buses. This allows 
instructions such as MACD and BLPD to effectively execute in a single cycle 
when they repeat. See Section 7.6, Single Instruction Repeat Loops, for details 
on these instructions. 


The stack is 16 bits wide and eight levels deep. The PC stack is accessible 
through the use of the PUSH and POP instructions. Whenever the contents of 
the PC are pushed onto the top of the stack, the previous contents of each level 
are pushed down, and the bottom (eighth) location of the stack is lost. There- 
fore, data will be lost if more than eight successive pushes occur before a pop. 
The reverse happens on pop operations. Any pop after seven sequential pops 
yields the value at the bottom stack level, and all of the stack levels then contain 
the same value. Two additional instructions, PSHD and POPD, push a data 
memory value onto the stack or pop a value from the stack to data memory. 
These instructions allow a stack to be built in data memory for the nesting of 
subroutines/interrupts beyond eight levels. See Section 7.3, Software Stack, 
for details on software stack. 


3.6.2 Pipeline Operation 
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Instruction pipelining consists of the sequence of bus operations that occur 
during instruction execution. In the operation of the pipeline, the instruction 
fetch, decode, operand fetch, and execute operations are independent, which 
allows overall instruction executions to overlap. Thus, during any given cycle, 
one to four different instructions can be active, each at a different stage of com- 
pletion, resulting in a four-deep pipeline. Figure 3-14 shows the operation of 
the four-level pipeline for single-word single-cycle instructions executing with 
no wait states. The pipeline is essentially invisible to the user except in some 
cases, such as auxiliary register updates, memory-mapped accesses of the 
CPU registers, the NORM instruction, and memory configuration commands. 
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or 


EXAM3 LAR AR2,#067h ;AR2 = 67. 
LACC #064h 7;ACC = 00000064. 
SAMM AR2 ;AR2 = 64, 
NOP ;Pipeline protection. 
NOP ;Pipeline protection. 
LACC *= ;AR2 = 63. 
ADD *— ;AR2 = 62. 


In EXAM1, the decode phase of the ADD instruction is on the same cycle as 
the execute (write) phase of the SAMM instruction. Both of these instructions 
are trying to load AR2. The ADD *— update does load AR2, while the SAMM 
execution is voided. In EXAM2, a NOP is strategically placed to avoid the con- 
flict between the ADD *— update of the AR2 and the SAMM write to AR2. In this 
code’s sequence, 


AR2 = 67 —> 66 —> 64— 63 


Note that the LACC address is based on the value in AR2 before the SAMM 
write to AR2. In EXAM3, the SAMM write to AR2 is completed before either the 
LACC or the ADD have updated AR2. Any two instruction words that do not 
update AR2 can be used in place of the two NOP instructions. This could be 
two one-word instructions or one two-word instruction. The results obtained by 
EXAM1 and EXAM2 code examples may be different if the code is interruptible. 
The user should avoid writing code similar to EXAM1 and EXAM2.  ~ 


The pipeline effect described above requires writes to memory-mapped regis- 
ters to allow for a latency between the write and an access of that register. 
These registers can be accessed by TMS320C5x instructions in the decode 
and operand fetch phases of the pipeline. Table 3—4 outlines the latency re- 
quired between an instruction that writes the register and the access of that 
register. 
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3.6.3 Status and Control Registers 


There are four key status and control registers for the TMS320C5x core. STO 
and ST1 contain the status of various conditions and modes compatible with 
the TMS320C25, while PMST and CBCR contain extra status and control infor- 
mation for control of the enhanced features of the TMS320C5x core. These 
registers can be stored into data memory and loaded from data memory, thus 
allowing the status of the machine to be saved and restored for subroutines. 
STO, ST1, and PMST each have an associated one-deep stack for automatic 
context-saving when an interrupt trap is taken. The stack is automatically 
popped upon a return from interrupt. Note that the XF bit in ST1 is not saved 
on the one-deep stack or restored from that stack on an automatic context 
save. This feature allows the XF pin to be toggled in an interrupt service routine 
while still allowing automatic context saves. 


The PMST and CBCR registers reside in the memory-mapped register space 
in page zero of data memory space. Therefore, they can be acted upon directly 
by the CALU and the PLU. They can be saved in the same way as any other 
data memory location. Note that the CALU and the PLU operations change the 
bits of these status registers during the execute phase of the pipeline. The next 
two instruction words, following an update of these status registers, may not 
be affected by the reconfiguration caused by the status update. 


The LST instruction writes to STO and ST1, and the SST instruction reads from 
them, except that the INTM bit is not affected by the LST instruction. Unlike the 
PMST and CBCR registers, the STO and ST1 registers do not reside in the 
memory map and, therefore, cannot be handled by using the PLU instructions. 
The individual bits of these registers can be set or cleared with the SETC and 
CLRC instructions. For example, the sign-extension mode is set with SETC 
SXM or cleared with CLRC SXM. 


Figure 3~15 shows the organization of the four status registers, indicating all 
status bits contained in each. Several bits in the status registers are reserved 
and read as logic ones. Table 3-5 defines all the status/control bits. 
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Table 3-5. Status Register Field Definitions (Continued) 


3-40 


Carry Bit. This bit is set to 1 if the result of an addition generates a carry, or is reset to 0 if the result of 
a subtraction generates a borrow. Otherwise, it is reset after an addition or is set after a subtraction, 


‘| unless the instruction is ADD or SUB with a 16-bit shift. In these cases, the ADD can only set and the 


SUB only reset the carry bit, but they cannot affect it otherwise. The single-bit shift and rotate instruc- 
tions, as well as the SETC, CLRC, and LST #1 instructions also affect this bit. C is set to 1 on a reset. 


Circular Buffer 1 Auxiliary Register. These three bits identify which auxiliary register is assigned to circu- 
lar buffer 1. 


Circular Buffer 2 Auxiliary Register. These three bits identify which auxiliary register is assigned to circu- 
lar buffer 2. 


Circular Buffer 1 Enable. This bit, when setto 1, enables circular buffer 1. When CENB‘1 is setto 0, circu- 
lar buffer 1 is disabled. CENB1 is set to zero upon reset. 


Circular Buffer 2 Enable. This bit, when setto 1, enables circular buffer2. When CENBZ2is set to 0, circu- 
lar buffer 2 is disabled. CENB2 is set to zero upon reset. 


On-chip RAM Configuration Control Bit. If this bit is set to 0, the reconfigurable-data dual-access RAM 
blocks are mapped to data space; otherwise, they are mapped to program space. The CNF may be 
modified by the SETC CNF, CLRC CNF, and LST #1 instructions. RS sets the CNF to 0. 


Data Memory Page Pointer. The 9-bit DP register is concatenated with the 7 LSBs of an instruction word 
to form a direct memory address of .16 bits. DP may be modified by the LST and LDP instructions. 


Hold Mode Bit. When HM = 1, the processor halts internal execution when acknowledging an active 
HOLD. When HM = 0, the processor may continue execution out of internal program memory but puts 
its external interface in a high-impedance state. This bit is set to 1 by reset. 


Interrupt Mode Bit. When this bit is set to 0, all unmasked interrupts are enabled. When it is set to 1, 
all maskable interrupts are disabled. INTM is set and is reset by the SETC INTM and CLRC INTM in- 
structions. RS and IACK also set INTM. INTM has no effect on the unmaskable RS and NMI interrupts. 
Note that INTM is unaffected by the LST instruction. This bit is set to 1 by reset. It is also set to 1 when 
a maskable interrupt trap is taken. itis reset to 0 when a RETE (return from interrupt with interrupt en- 
able) is executed. 


Interrupt Vector Pointer. These five bits point to the 2K page where the interrupt vectors reside. This 
allows the user to remap the interrupt vectors to RAM for boot-loaded operations. At reset, these bits 
are all set to zero. Therefore, the reset vector always resides at zero in the program memory space. 


Microprocessor/Microcomputer Bit. When this bit is set to zero, the on-chip ROM is enabled. When it 
is set to one, the on-chip ROM is not addressable. This bit is set to the value corresponding to the logic 
level on the MP/MC pin at reset. The level on the MP/MC pin is sampled at device reset only and can 
have no effect until the next reset. 


Enable Extra Index Register. This bit configures indexed indirect addressing and auxiliary address reg- 
ister compare to operate either ina TMS320C2x-compatible mode (NDX = 0) or ina TMS320C5x-en- 
hanced mode (NDX = 1). When NDX = 0, the LAR ARO instruction loads the INDX and ARCR registers 
in addition to ARO. This is because the TMS320C2x devices use ARO for indexing and AR compare 
operations. When NDX = 1, INDX and ARCR are not affected by the LAR instruction. NDX = 0 at reset. 


Overflow Flag Bit. As a latched overflow signal, OV is set to 1 when overflow occurs in the ALU. Once 
an overflow occurs, the OV remains set until a reset, BCND(D) on OV/NOV, or LST instruction clears 
OV. 
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Table 3-6. On-Chip RAMT Configuration Control 


[—pevies | ovi_Raw [Configuration] 
[fusecocso [0 0 | _Onchip aK RAMiedsabied =| 


TMS320C50 On-chip 9K RAM is in both program and data 
spaces 


1 1 i 
TMS320C51 On-chip 1K RAM is disabled 
TMS320C51 0 1 On-chip 1K RAM is mapped into program space 
TMS320C51 On-chip 1K RAM is mapped into data space 


TMS320C51 On-chip 1K RAM is in both program and data 
spaces 


t Excluding on-chip dual-access RAM blocks. 


3.6.4 Repeat Counter 


RPTC is a 16-bit repeat counter, which, when loaded with anumber N, causes 
the next single instruction to be executed N + 1 times. The RPTC register is 
loaded by either the RPT or the RPTZ instruction. This results in a maximum 
of 65,536 executions of a given instruction. RPTC is cleared by reset. The 
RPTZ instruction clears both ACC and PREG before the next instruction starts 
repeating. Once a repeat instruction (RPT or RPTZ) is decoded, all interrupts 
(except reset) are masked until the completion of the repeat loop. The RPTC 
register resides in the CPU’s memory-mapped register space; however, you 
should avoid writing to this register. 


The repeat function can be used with instructions such as multiply/accumu- 
lates (MAC and MACD), block moves (BLDD and BLPD), I/O transfers (IN/ 
OUT), and table read/writes (TBLR/TBLW). These instructions, although nor- 
mally multicycle, are pipelined when the repeat feature is used, and they effec- 
tively become single-cycle instructions. For example, the table read instruction 
may take three or more cycles to execute, but when the instruction is repeated, 
a table location can be read every cycle. Note that not all instructions can be 
repeated. Table 3-7 lists all of the TMS320C5x instructions, segregated ac- 
cording to their repeatability. 
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Table 3-7a. Repeatable Instructions (Concluded) | 
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_ | ROLB Rotate combined ACC and ACCB left once 


Repeatable Instructions Description 
- PUSH Push low ACC to the PC stack 
RO Rotate ACC left once 
B 


[sac Storeiow ACC wihshit SSCS 


| 


SPAC - s:Subtract PREG from ACC 


L 
R 
SAMM Store low ACC direct/indirect to data page 0 

SAR AR," Store AR indirect addressed . 
SPH ;Store high PREG to direct/indirect addressed 
SPL 
T 


SATH Shift ACC right 0 or 16 bits as specified by TREG1(4) 


SBBB Subtract ACCB from ACC with borrow 


XPL :XOR DBMR to direct/indirect addressed 
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Instructions Not Meaningful 
to Repeat 


Table 3-7c. _Nonrepeatable Instructions 


Description 
sAdd to ACC short immediate 


ADD #k, shift ;Add to ACC long immediate with shift 
ADRK ;Add to AR short immediate 
AND #1k, shft sAND to ACC long immediate with shift 


;Compare long immediate to direct/indirect addressed 
IDLE sIdle CPU 


IDLE2 sIdle until interrupt — low power mode 
INTR ;Soft interrupt 
LACC _ #ik,shft , sLoad ACC long immediate 
LACL = #k ;Load ACC short immediate 
;Load AR with long immediate 
;Load DP short immediate 


;Non-maskable interrupt 

;OR long immediate to direct/indirect addressed 
;OR to ACC long immediate with shift 

;Return [delayed] from subroutine conditionally 
;Return from subroutine 
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SPLK #010h,BRCR ;Set loop count to 16. 
RPTB END_LOOP-1 7For I = BRCR; I > =0; I- -. 


ZAP ;ACC = PREG = 0. 
SQRA *,AR2 ;PREG = X2, 
SPL SORX ;Save x2, 
MPY * ;PREG = b x X. 
LTA SQRX ;ACC = bX. TREG = X2, 
MPY * ;PREG = aX?2. 
APAC ;ACC = aX? + bX. 
ADD *,0,AR3 ;ACC = aX2 + bX + c= Y. 
SACL *,0,AR1 ;Save Y. 
CRGT ;Save MAX. 
END_LOOP 


The example implements 16 executions of Y = aX2 + bX + cand saves the max- 
imum value in ACCB. Note that the initialization of the auxiliary registers is not 
shown in the coded example. PAER is loaded with the address of the last word 
in the code segment. The label END_LOOP is placed after the last instruction, 
and the RPTB instruction long immediate is defined as END_LOOP-—1 in case 
the last word in the loop is a two-word instruction. 


There is only one set of block repeat registers, so multiple block repeats cannot 
be nested without saving the context of the outside block or using BANZD. The 
simplest method of executing nested loops is to use the RPTB for only the in- 
nermost loop and using BANZD for all the outer loops. This is still a valuable 
cycle-saving operation because the innermost loop is repeated significantly 
more times than the outer loops. Block repeats can be nested by storing the 
context of the outer loop before initiating the inner loop, then restoring the outer 
loop’s context after completing the inner loop. The context save and restore 
are shown in the following example: 


SMMR BRCR, TEMP1 7;Save block repeat counter. 

SMMR PASR, TEMP2 ;Save block start address. 

SMMR PAER, TEMP3 ;Save block end address. 

SPLK #NUM_LOOP,BRCR ;Set inner loop count. 

RPTB END_ INNER 7;For I = 0; I<=BRCR; I++. 
END_INNER 

OPL #1,PMST ;Set BRAF to continue outer loop. 

LMMR BRCR, TEMP1 7;Restore block repeat counter. 

LMMR PASR, TEMP2 7;Restore block start address. 

LMMR PAER, TEMP3 ;Restore block end address. 


In this example, the context save and restore operations take 14 cycles. Note 
that repeated single and BANZ/BANZD loops can also be inside a block re- 


‘peat. The repeated code can include subroutine calls. Upon returning, the 


block repeat resumes. Repeated blocks can be interrupted. When an enabled 
interrupt occurs during a repeated block of code, the CALU traps to the inter- 
rupt and, when the ISR returns, the block repeat resumes. 
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of the last word of the table. Notice that the label marking the end of the loop 
is placed after the last instruction, then the PAER is loaded with that label, mi- 
nus 1. It is possible to place the label before the CALA instruction, then load 
the PAER with the label address because this is a one-word instruction. How- 
ever, if the last instruction in this loop had been a two-word instruction, the sec- 
ond word of the instruction would not be read, and the long immediate operand 
would be substituted with the first instruction in the loop. 


Inside the loop, the pointer to the task table is incremented and saved. Then, 
the task address is read from the table and loaded into the accumulator. Next, 
the task is called by the CALA instruction. Notice that, when the task returns 
to the task handler, it returns to the top of the loop. This is because the PC has 
already been loaded with the PASR before the CALA executes the PC disconti- 
nuity. Therefore, when the CALA is executed, the address of the top of the loop 
is pushed onto the PC stack. 


The last two words of a repeat-block loop are not interruptible. In other words, 
the interrupt path will not be taken while the last two instruction words of a re- 
peat block are being fetched. 


Example 3-1. Interrupt Operation With a Single-Word Instruction at the End of an RPTB 


RPTB END_LOOP-1 


SAR ARO, * < interrupt path taken here 
if not the last loop iteration 


SACL * <~ interrupt occurs here 
ENDLOOP : 
MAR *,AR1 < Interrupt path taken here if interrupt 


occurs during last two instruction words 
of the last loop iteration 


Example 3-2. Interrupt Operation With a Two-Word Instruction at the End of an RPTB 
RPTB END_LOOP-1 


SAR ARO, * <- interrupt path taken here 
if not the last loop iteration 


LACC e+ 


SPLK #1234h,* ¢ interrupt occurs here 
ENDLOOP : 
MAR *, ARL < Interrupt path taken here if interrupt 


occurs during last two instruction words 
of the last loop iteration 


Note that any incoming interrupt will be latched by the TMS320C5x as soon 
as it meets the interrupt timing requirement. However, the PC will not branch 
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3.7 Parallel Logic Unit (PLU) 


The parallel logic unit (PLU) can directly set, clear, test, or toggle multiple bits 
in acontrol/status register or any data memory location. The PLU, shownin » 
the block diagram in Figure 3-16, provides a direct logic operation path to data 
memory values without affecting the contents of the accumulator or product 
register. It can be used to set or clear multiple bits in a control register or to test 
multiple bits in a flag register. 


Figure 3-16. Parallel Logic Unit Block Diagram 
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The PLU executes a read-modify-write operation on data stored in data space. 

The PLU operation begins with the fetching of one operand from data memory 
space and the fetching of the second from either long immediate on the pro- 
gram bus or the dynamic bit manipulation register (DBMR). Then, the PLU ex- 
ecutes a logical operation defined by the instruction on the two operands. The 
result is written to the same data memory location from which the first operand 
was fetched. 


The PLU allows the direct manipulation of bits in any location in data memory 
space. This direct bit manipulation is done by ANDing, ORing, XORing, orload- 
ing a 16-bit long immediate value to a data location. For example, to use AR1 
for circular buffer 1 and AR2 for circular buffer 2 but not enable the circular buff- 
ers, initialize the circular buffer control register (CBCR) by executing this: 


SPLK #021h,CBCR ;Store peripheral long immediate.; (DP = 0). 
To later enable circular buffers 1 and 2, execute 
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3.8.1 
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Interrupts 


Reset 


The TMS320C5x core CPU supports sixteen user-maskable interrupts 
(INT16-INT1). However, each TMS320C5x DSP does not necessarily use all 
16. For example, the TMS320C50 and TMS320C51 use only nine of these in- 
terrupts (the others are tied high inside the device). Interrupts can be gener- 
ated by the serial ports (RINT and XINT), by the timer (TINT), and by the soft- 


ware interrupt (TRAP) instruction. The reset (RS) interrupt has the highest 
priority, and the INT16 interrupt has the lowest priority. 


Reset (RS) is a nonmaskable external interrupt that can be used at any time 
to put the TMS320C5x into a known state. Reset is typically applied after pow- 
er-up when the machine is in an unknown state. 


Driving the RS signal low causes the TMS320C5x to terminate execution and 
forces the program counter to zero. RS affects various registers and status 
bits. At power-up, the state of the processor is undefined. For correct system 
operation after power-up, a reset signal must be asserted low for one full clock 
cycle. The device will latch the reset pulse and generate an internal reset pulse 
of five cycles, long enough to guarantee a reset of the device. Processor ex- 
ecution begins at location 0, which normally contains a branch instruction to 
the system initialization routine. 


When the RS signal is received, the following actions occur: 


1) Alogic0 is loaded into the CNF (configuration control) bitin status register 
ST1, mapping dual-access RAM block 0 into data address space. 


2) The program counter (PC) is set to 0. The address bus (lines A15 — AO) 
is unknown while RS is low, unless the HOLD input of the device is low. 
In this case, the address lines are placed into a high-impedance state until 
HOLD is brought back high. 


3) All interrupts are disabled by setting the INTM bit (interrupt mode) to 1; 
note that RS is nonmaskable. The interrupt flag register (IFR) is cleared. 


4) Status bits are set as follows: 
0-—OV, 1—XF, 1—SXM, 0PM, 1—HM, 0—- BRAF, 
0—>TRM, 0 NDX, 0—>~CENB1, 0->CENB2, 0—-IPTR, 
0- OVLY, O-— AVIS, 0— RAM, O-— BIG, 0-—-CNF, 
1—INTM, MP/MC (Pin) > PMST (MP/MC), and 1—>C, 


Note that the remaining status bits remain undefined and should be initial- 
ized appropriately. 


5) The global memory allocation register (GREG) is cleared to make all 
memory local. 
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Table 3-8. Interrupt Locations and Priorities 


eee ee 


asf oo | a igh 


INT1 bigs sl user interrupt #1 
INT2 a oes user interrupt #2 : 
INT3 | dB user interrupt #3 


ta [ e [| 8 | 6 | wserinterupt a4 
NTS [10 | A | 7 | aserinterupt #5 
Te [8 | aserinterupt a6 


user interrupt #7 
user interrupt #8 
user interrupt #9 
user interrupt #10 
user interrupt #11 
user interrupt #12 
user interrupt #13 
user interrupt #14 
user interrupt #15 
user interrupt #16 
TRAP instruction vector 


nonmaskable interrupt 


Tt The interrupt numbers here do not correspond to any specific TMS320C5x device. 
_ The definitions of the interrupts, specific to particular TMS320C5x devices, are cov- 
ered in Chapter 5. 


Figure 3-17. Interrupt Vector Address Generation 


IPTR = 00001 INT =5 
~N 


“XN 

Vector filo o o} |o 0 0. 

Bit 15 14 13 12 10 9 8 
Upon reset, the IPTR bits are all set to zero, thus mapping the vectors to page 
zero in program memory space. This means the reset vector always resides 
at zero. The interrupt vectors can be moved to another location by loading a 


nonzero value into the IPTR bits. For example, the interrupt vectors can be 
moved to start at location 0800h by loading the IPTR with 1. 


fan] i 
| 7 


\ 


When an interrupt occurs, a flag is activated in the 16-bit interrupt flag register 
(IFR). Each interrupt is stored in the IFR until itis recognized by the CPU. Any 
of the following four events will clear the interrupt flag: 
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ecuting the soft vector. The following example, Example 3-3, illustrates the 
minimum latency from the time an interrupt occurs externally to the interrupt 
acknowledge (IACK). The minimum interrupt acknowledge time is defined as 
- 8 cycles: 


1) 3-cycles to externally synchronize the interrupt 
2) 1-cycle to for the interrupt to be recognized by the CPU 
3) 4 cycles to execute the INTR instruction and flush the pipeline 


On the ninth cycle, the interrupt vector is fetched and the [ACK is generated. 


Example 3-3. Minimum Interrupt Latency 


Interrupt occurs prior to 
the fetch of this ae 


Fetch Mainl 
Decode 
Read 


Execute 


Main2 
Main1l 


3.8.3 
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Main3 


Main2 
Mainl 


Interrupt written 


to IFR 

L L This instruction will be refetched after return from interrupt 
Main4 MainS Main6é Dummy Dummy Dummy Vecl Vec2 Dummy Dummy ISR1 
Main3 Main4 Main5 INTR Dummy Dummy Dummy VEC1 VEC2 DUMMY DUMMY 
Main2 Main3 Main4 Main5 INTR Dummy Dummy Dummy vecl vec2 Dummy 
Mainl Main2 Main3 Main4 Main5 INTR Dummy Dummy Dummy Vecl vec2 
Interrupt latched external INTR jammed IACK generated 
to the CPU into the pipeline here 


The maximum latency is a function of what is in the pipeline. Multicycle instruc- 
tions add additional cycles to empty the pipeline. This applies to instructions 
that are extended via wait-state insertion on memory accesses. The wait states 
required for interrupt vector accesses also affect the latency. The repeat next 
instruction N times (RPT and RPTZ) also lock out interrupts, and the repeated 
instruction completes all executions before allowing the interrupt to execute. 
This is to protect the context of the repeated instructions because when re- 
peated, the instructions run more parallel operations in the pipeline, and the 
context of these additional parallel operations cannot be saved in an ISR. The 
HOLD function takes precedence over interrupts and also can delay the inter- 
rupt trap. If an interrupt happens during an active-HOLD state, the interrupt is 
taken at the completion of the HOLD state. 


Interrupts cannot be processed between CLRC INTM and the next instruction 
in a program sequence. For example, if an interrupt occurs during an CLRC 
INTM instruction execution, the device always completes CLRC INTM as well 
as the following instruction before the pending interrupt is processed. This en- 
sures that a return (RET) can be executed before the next interrupt is pro- 
cessed—thus protecting against PC stack overflow. If the ISR is exited via a 
RETE (return from ISR with enable), the CLRC INTM is unnecessary. 


Interrupt Context Save 


When an interrupt trap is executed, certain strategic registers are saved auto- 
matically. When the return from interrupt instruction (RETE or RETI) is ex- 
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In the example, the address of the reentry point within the ISR is pushed onto 
the PC stack. The RETI instruction pops all the stacks, including the PC stack, 
and resumes execution. At the end of the ISR, a standard return is executed 
because the stack is already popped. 


Not all of the 16 core CPU interrupts are necessarily used on any given 
TMS320C5x device. The vectors for the interrupts not tied to specific external 
pins or internal peripherals can be used as software interrupts. To use the cor- 
responding interrupt vectors as software traps with full context save and re- 
store, execute the INTR instruction with the appropriate interrupt number as 
an operand. These traps are protected from other interrupts in the same way 
the ISR is protected; all interrupts are globally masked via the INTM bit. To ex- 
ecute the context restore, these trap routines must be exited via the RET! or 
RETE instruction. For example, 


INTR 15 ;Software trap to address 01Eh. 


In this example, the processor will trap to the vector relatively located at 01Eh. 


3.8.4 Nonmaskable Interrupt 
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The core of the TMS320C5x has two nonmaskable interrupts, reset and NMI. 
Reset is discussed in subsection 3.8.1 NMlis used as a soft reset. It is different 
from a standard interrupt because it is not maskable, and it does not invoke the 
automatic context save. The context save is not invoked, because itis possible 
to take the NMI even during an interrupt service routine. In addition, interrupts 
are globally disabled during an NMI instruction. The NMI is different from reset 
in that it does not affect any of the modes of the device. Note that some 
TMS320C5x devices may not make the NMI available externally. The NMI is 
also delayed by multicycle instructions and HOLD, as described in subsection 
3.8.2. The NMI trap can also be initiated via software using the NMI instruction. 
This instruction forces the PC to the NMI trap location. 
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4 Assembly Language Instructions 


4.1 Memory Addressing Modes 


The TMS320C5x instruction set provides six basic memory addressing 
modes: 


Direct addressing mode 

Indirect addressing mode 
Immediate addressing mode 
Dedicated register addressing mode 


Memory-mapped register addressing mode . 


Oodddz#ad oda 


Circular addressing mode 


Both direct and indirect addressing can be used to access data memory. Direct 
addressing concatenates seven bits of the instruction word with the nine bits 
of the data memory page pointer to form the 16-bit data memory address. Indi- 
rect addressing accesses data memory through one of eight auxiliary regis- 
ters. In immediate addressing, the data is based on a portion of the instruction 
word(s). Two types of immediate addressing modes are available: short and 
long. In short immediate addressing, an 8-/9-/13-bit operand is included in the 
instruction word. Long immediate addressing mode uses as its operand a 
16-bit word following the instruction. Dedicated register addressing refers to 
the block move instructions in which the BMAR register addresses program or 
data memory and the parallel logic unit (PLU) instructions in which operands 
are obtained from the DBMR register. Memory-mapped register addressing 
mode is used to load and store memory-mapped registers. Circular addressing 
is an additional mode of indirect addressing that automatically wraps to the be- 
ginning of a block of data when the end of the block is reached. The following 
subsections describe each addressing mode and give the opcode formats and 
some examples for each mode. 


4.1.1 Direct Addressing Mode 


In the direct memory addressing mode, the instruction contains the lower 
seven bits of the data memory address (dma). This field is concatenated with 
the nine bits of the data memory page pointer (DP) register to form the full 
16-bit data memory address. Thus, the DP register points to one of 512 possi- 
ble 128-word data memory pages, and the 7-bit address in the instruction 
points to the specific location within that data memory page. The DP register 
is loaded by using the LDP (load data memory page pointer) or the LST #0 
(load status register STO) instructions. 


4-2 Assembly Language Instructions 


The opcode of the ADD 9h,5 instruction is 25h and appears in bits 15 through 
8. The shift count of 5 appears in bits 11 through 8 of the opcode. The data 
memory address 09h appears in bits 6 through 0. 


4.1.2 Indirect Addressing Mode 


Eight auxiliary registers (ARO—AR7) provide flexible and powerful indirect ad- 
dressing on the TMS320C5x. To select a specific auxiliary register, load the 
auxiliary register pointer (ARP) with a value from 0 through 7, designating ARO 
through AR7, respectively (see Figure 4—2). 


Figure 4-2. Indirect Addressing Block Diagram 


Baia Bus 16) 


Auxiliary Registers 


(ARP = 2) 


ARAU (16) 


16-Bit Data Address 


The contents of the auxiliary registers may be operated upon by the auxiliary 
register arithmetic unit (ARAU), which implements unsigned16-bit arithmetic. 
The ARAU performs auxiliary register arithmetic operations in the decode 
phase of the pipeline. This allows the address to be generated before the de- 
code phase of the next instruction. The AR is incremented or decremented af- 
ter it is used in the current instruction. 


In indirect addressing, any location in the 64K data memory space can be ac- 
cessed via a 16-bit address contained in an auxiliary register. The LAR instruc- 
tion loads the address into the register. The auxiliary registers on the 
TMS320C5x may be modified by ADRK (add to auxiliary register short immedi- 
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forms the specified mathematical operation on the indicated auxiliary register. 
Additionally, the ARP may be loaded with a new value. All indexing operations 
are performed on the current auxiliary register in the same cycle as the original 
instruction decode phase of the pipeline. 


Indirect auxiliary register addressing allows for post-access adjustments of the 
auxiliary register pointed to by the ARP. The adjustment may be an increment 
or decrement by one or may be based upon the contents of the INDX register. 
To maintain compatibility with the TMS320C2x devices, set the NDX bit in the 
PMST register to 0. In the TMS320C2x architecture, the current auxiliary regis- 
ter can be incremented or decremented by the value in the ARO register. When 
the NDX bit is set to 0, every ARO modification or LAR write also writes the 
ARCR and INDX registers with the same value. Subsequent modifications of 
the current auxiliary registers using indexed addressing will use the INDX reg- 
ister, therefore maintaining compatibility with existing TMS320C2x code. The | 
NDxX bit is set to 0 at reset. 


Bit-reversed addressing modes on the TMS320C5x allow efficient I/O to be 
performed by the resequencing of data points in a radix-2 FFT program. The 
direction of carry propagation in the ARAU is reversed when this mode is se- 
lected, and INDX is added to/subtracted from the current auxiliary register. 
Typical use of this addressing mode requires that INDX first be set-to a value 
corresponding to one-half of the array’s size, and that AR(ARP) be set to the 
base address of the data (the first data point). 


Indirect addressing can be used with all instructions except immediate oper- 
and instructions and instructions with no operands. The indirect addressing 
format is as follows: 


15 141312111098 7 6 5 4 3 210 


Bits 15 through 8 contain the opcode, and bit 7 = 1 defines the addressing mode 
as indirect. Bits 6 through 0 contain the indirect addressing control bits. 


Bit 6 contains the increment/decrement value (IDV). The IDV bit determines 
whether the INDX register will be used to increment or decrement the current 
auxiliary register. If bit 6 = 0, an increment or decrement (if any) by one occurs 
to the current auxiliary register. If bit 6 = 1, the INDX register is added to or sub- 
tracted from the current auxiliary register as defined-by bits 5 and 4. 


Bits 5 and 4 control the arithmetic operation to be performed with AR(ARP) and 
the INDX register. When set, bit 5 indicates that an increment is to be per- 
formed. If bit 4 is set, adecrementis to be performed. Table 4—1 shows the cor- 
respondence of bit pattern and arithmetic operation. 
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sigeeansnaayy AA SHAN S SESSA NINI YR AA AUPE A AAA AAA A RRA RE RRO 


Example 1 


Example 2 


Example 3 


Example 4 


| Example 5 


Example 6 


_ Example 7 
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The CMPR (compare auxiliary register with ARCR) and TC/NTC conditions fa- 
cilitate conditional branches, calls, returns, or conditional executes according 
to comparisons between the contents of ARCR and the contents of AR(ARP). 
To maintain compatibility with the TMS320C2x devices, set the NDX bit in the 
PMST register to 0. Inthe TMS320C2x architecture, the auxiliary register com- 
pare function is performed by comparing ARO with the current auxiliary regis- 
ter. When the NDX bit is set to 0, every load to ARO loads the ARCR register 
with the same value. Subsequent compares of the current auxiliary register will 
use the ARCR register, therefore maintaining compatibility with existing 
TMS320C2x code. The NDX bit is set to 0 at reset. The auxiliary registers may 
also be used for temporary storage via the load and store auxiliary register in- 
structions, LAR and SAR, respectively, or via any instruction that can load and 
store the memory-mapped auxiliary registers. 


The following examples illustrate the indirect addressing format: 
ADD *+,8 


Add to the accumulator the contents of the data memory address defined by 
the contents of the current auxiliary register. This data is left-shifted 8 bits be- 
fore being added. The current auxiliary register is autoincremented by one. 
The instruction word is O28A0h. 


ADD *,8 


As in Example 1, but with no autoincrement; the instruction word is 02880h. 
ADD *-,8 
As in Example 1, except that the current auxiliary register is decremented by 
one; the instruction word is 02890h. 
ADD *0+,8 
As in Example 1, except that the contents of register INDX are added to the 
current auxiliary register; the instruction word is O28E0h. 
ADD *0-,8 
As in Example 1, except that the contents of register INDX are subtracted from 
the current auxiliary register; the instruction word is O28DOh. 
ADD *+,8,AR3 
As in Example 1, except that the auxiliary register pointer (ARP) is loaded with 
the value 3 for subsequent instructions; the instruction word is O28ABh. 
ADD *BRO-,8 


The contents of register INDX are subtracted from the current auxiliary regis- 
ter, with reverse carry propagation; the instruction word is O28COh. 
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Memory Addressing Modes 


The following is an example code and the instruction word format for the RPT 
instruction with long immediate addressing: 


RPT #0FFFh ;Execute the instruction following the RPT instruction 1000h times. 


15 14 13 12 11109 8 7 6 5 43 2 1 =0 
1014 4 41 17 40 14 4 0 0 0 1 0 0 


16-bit constant 


4.1.4 Dedicated Register Addressing 


Nine instructions in the TMS320C5x instruction set can use one of two spe- 
cial-purpose memory-mapped registers in the core CPU. These two registers 
are the block move address register (BMAR) and the dynamic bit manipulation 
register (DBMR). The APL, OPL, CPL, and XPL parallel logic unit (PLU) in- 
structions use the contents of the DBMR register when an immediate value is 
not specified as one of the operands. The BLDD, BLDP, and BLPD instructions 
can use the BMAR register to point at the source or destination space of a block 
move. The MADD and MADS also use the BMAR register to address an oper- 
and in program memory for a multiply-accumulate operation. 


The syntax for dedicated register addressing can be stated in one of two ways: 
1) specifying BMAR by its predefined symbol as shown below: 


BLDD BMAR,DAT100 ;DP = 0. BMAR contains the value 200h. 


The contents of data memory location 200h are copied to data memory lo- 
cation 100 on the current data page. The opcode for this instruction is 
OAC64h. 


2) excluding the immediate value from parallel logic unit instructions as 
shown below. The BMAR register is implied by the MADD and MADS in- 
struction mnemonics. 


OPL DAT10 ;DP = 6. DBMR contains the value OFFFOh. 
;Address O30Ah contains the value Olh 


The contents of data memory location 030Ah are ORed with the contents 
of DBMR. The resulting OFFF1his stored back to memory location O30Ah. 
The opcode for this instruction is 590Ah. 


4.1.5 Memory-Mapped Register Addressing 


4-10 


Memory-mapped register addressing is used for modifying the 
memory-mapped registers without affecting the current data page pointer val- 
ue. In addition, any scratch pad RAM location or data page 0 can be modified 
by using this addressing mode. Figure 4—3 illustrates how this is done by forc- 
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implement a sliding window, which contains the most recent data to be pro- 


cessed. The TMS320C5x supports two concurrent circular buffers operating 
via the auxiliary registers. The following five memory-mapped registers control 
the circular buffer operation: 

CBSR1_ —Circular Buffer One Start Register 

CBSR2 _ —Circular Buffer Two Start Register 

CBER1 -—Circular Buffer One End Register 

CBER2 -Circular Buffer Two End Register 

CBCR -- —Circular Buffer Control Register 


OoOvoo 


The 8-bit circular buffer control register enables and disables the circular buffer 
operation. The CBCR is defined as follows: 


Identifies which auxiliary register is mapped to circular buffer 1. 


Circular buffer 1, enable=1/disable=0. Set to 0 upon reset. 
Identifies which auxiliary register is mapped to circular buffer 2. 
Circular buffer 2, enable=1/disable=0. Set to 0 upon reset. 


In order to define circular buffers, the start and end addresses should first be 
loaded into the corresponding buffer registers; next, a value between the start 
and end registers for the circular buffer is loaded into an auxiliary register. The 
proper auxiliary register value is loaded, and the corresponding circular buffer 
enable bitis set in the control register. Note that the same auxiliary register can 
not be enabled for both circular buffers, or unexpected results will occur. The 
algorithm for circular buffer addressing is as follows (note that the test of the 
auxiliary register value is performed before any modifications): 


lf (ARn = CBER) and (any AR modification), 


’ Then: ARn = CBSR. 


Else: ARn = ARn + step. 


In addition, note that if ARn=CBER and no AR modification occurs, the current 
AR is not modified and is still equal to CBER.Note that when the current auxilia- 
ry register = CBER, any AR modification (increment or decrement) will set the 
current AR = CBSR. The following examples illustrate the operation: 


splk #200h,CBSR1 ; Circular buffer start register 
splk #203h,CBER1 ; Circular buffer end register 
splk #0eh,CBCR ; Enable AR6 pointing to buffer 1 


- 


lar ar6,#200h ; Case 1 
lacc * ; AR6 = 200h 
lar ar6, #203h ; Case 2 
lacc * 3; AR6 = 203h 
lar ar6, #200h ; Case 3 
lacc x+ ; AR6 = 201h 
lar ar6, #203h ; Case 4 
lace x+ 3; AR6 = 200h 
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4.2 Instruction Set 


The TMS320C5x assembly language instruction set supports both DSP-spe- 
cific and general-purpose applications. This section lists and groups the 
TMS320C5x instruction set according to the following functional headings: 


Accumulator Memory Reference Instructions 

Auxiliary Registers and Data Page Pointer Instructions 
Parallel Logic Unit Instructions 

T Register, P Register, and Multiply Instructions 
Branch Instructions 

/O and Data Memory eperatione 

Control Instructions 


OOdoove 


Section 4.1 covers the addressing modes associated with the instruction set, 
and Section 4.3 describes individual instructions in more detail. 


- 4.2.1. Symbols and Abbreviations 


Table 4-3 lists symbols and abbreviations used in the instruction set summary 
(Table 4—4) and the individual instruction descriptions (Section 4.3). 
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4.2.2 Instruction Set Summary 
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Table 4—4 is a summary of the instruction set for the TMS320C5x digital signal 
processors. This instruction set is a superset of the TMS320C1ix and 
TMS320C2x instruction sets. 


The instruction set summary is arranged according to function and is alphabet- 
ized within each functional grouping. The number of words that an instruction 
occupies in program memory is specified in column four of the table. Several 
instructions specify two values, separated by a slash mark ”/” for the number 
of words. Different forms of the instruction occupy a different number of words. 
For example, the ADD instruction occupies one word when the operand is a 
short immediate value or two words if the operand is a long immediate value. 
The number of cycles that an instruction requires to execute is in column four 
of the table. Allinstructions are assumed to be executed from internal program 
memory (RAM) and internal data dual-access memory. The cycle timings are 
for single-instruction execution, not for repeat mode. Additional information is 
presented in the Individual Instruction Descriptions in Section 4.3. The symbol 
# indicates those instructions that are new for the TMS320C5x instruction set. 


Section 4.4 includes a table that maps TMS320C2x instructions to 
TMS320C5x instructions. Note that the Texas Instruments TMS320C5x as- 
sembler will accept TMS320C2x instructions as well as TMS320C5x instruc- 
tions. 
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Table 4-4. Instruction Set Summary (Continued) 


Accumulator Memory Reference Instructions (Concluded) 


[winenonie | Deseription «Words [Cycles 
[sich | Sorenignaccwinane 


Store ACC to memory-mapped register 


1 (processor memory-mapped 
register) 

2 (peripheral memory-mapped 
registers) 


Barrel-shift ACC right 0 or 16 bits as specified by 1 
TREG1 


Barrel-shift ACC right 0 to 15 bits as specified by 
TREG1 


SFLB o# Shift ACCB and ACC left 
Shift ACC right 
SFRB # ‘Shift ACCB and ACC right 


SUB Subtract from ACC 1/2 
2 (long immediate value specified) 


1 sge 
1 
1 oge 


SUBC Conditional subtract 1 


Subtract from low ACC with sign-extension sup- 1 
pressed 


Subtract from ACC with shift specified by TREG1 


XOR Exclusive-OR with ACC 1/2 
2 (long immediate value specified) 


SUBB Subtract from ACC with borrow Ee See: ee 


Zero low ACC and load high ACC with rounding 


ZAP Zero ACC and PREG 
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DS SS SS Sa 


T Register, P Register, and Multiply Instructions 


ae ae 

ae aaa 
A 
ee 
case ali) 
ene res 
a Seals 
—— 


1 
1 
1 


1 
1 
ey 


2 (long immediate value specified) 


ZPR 
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Table 4—4. Instruction Set Summary (Continued) 


/O and Data Memory Operations 


[Description Words [eycles 
moments a move from data memory to data memory 2 (operand specified by BMAR) 
3 (operand specified by long im- 
. mediate) 


| BLOP # = | | Block move from data memory to program memory —_| move from data memory to | Block move from data memory to program memory —_| memory 


et Block move from program memory to data memory 2 (operand specified by BMAR) 
3 (operand specified by long im- 
mediate) 
DMOV Data move in data memory i a 


LMMR_~ # Load memory-mapped register 2 (processor memory-mapped 
register) 

3 (peripheral memory-mapped 

register) 


Store memory-mapped register 2 (processor memory-mapped 
register) 

3 (peripheral memory-mapped 

register) 


TaLR Tale ead 
a 
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4.3 Individual Instruction Descriptions 


This section furnishes detailed information on the instruction set for the 
TMS320C5x family; see Table 4—4, Instruction Set Summary, for a complete 
list of available instructions. Each instruction presents the following informa- 
tion: 

assembler syntax 

operands 

opcode 

execution 

description 

words 

cycles 

examples 


HOO 


The EXAMPLE instruction is provided to familiarize the user with the instruc- 
tion format and explain the contents of the instruction manual pages. 
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EXAMPLE Example Instruction 


aihiiinthiensnsenatouieanennsmumnmntnspnaennatse SS Sag OS 


CREE SSN ANN RRR eo eNO eT HES CN TOS PEO De OPEN Te Ne AEE TO eT NST aS AN UN USN INGTON OS oA Mtan att MERON SARA PSS MPNES ENRON EERE REMC NHGR ASR SARIS SOA GR USERS SSSA, 


data RAM. The cycle timings are for single-instruction execution, not for repeat mode. 
Note that writing or reading any of the memory-mapped peripheral registers over the pe- 
ripheral bus will add one additional cycle to the execution of that instruction. 
Example 


Example code is included for each instruction. The effect of the code on memory and/or 
registers is summarized. 
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ADCB Add ACCB t to 9 Accumulator With Carry 


op a esa en og na Sa RNAS AAA NAS NU la A cl AR Aaa 


Syntax [label ADCB 
Operands’ None 
Opcode 
15 14 13 i2 11:10 9 8 7 6 5 4 3 
1 014 4 4 4 4 0 0 0 0 141 0 0 0 1 
Execution (PC) +1 —PC 
(ACC) + (ACCB) + (C) — ACC 
Affected by OVM; affects OV and C 


Description Thecontents of the accumulator buffer (ACCB) and the value of the carry bit (C) are added 
to the accumulator. The carry bit is set to one if the result of the addition generates a carry 
from the MSB position of the accumulator. 


Words 1 
Cycles 1 


Example ADCB 
Before Instruction After Instruction 


acc [4] 1234h acc [0] 1237h 
Cc Cc 
accep = [AB 
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seshecnaeisiensenseiete 


OSE: 


ADD Add to Accumulato 


Description 


Words 
Cycles 


Example 1 


Example 2 


Example 3 


Example 4 
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SIS SRL SEN. RESTLESS EES SENATE NON SENN CN A NSE SNE NR HS NN A I OSLER NSS A SAAMI Sah eo ean ean a ete ee Saree ale ena eh see tees 


(PC) +2 —- PC 
(ACC) + Ik x 2shift2 _, ACC 
Affected by SXM and OVM; affects C and OV. — 


The contents of the addressed data memory location or an immediate constant are left- 
shifted and added to the accumulator. During shifting, low-order bits are zero-filled. 
High-order bits are sign-extended if SXM = 1 and zero-filled if SXM=0. The result is stored 
in the accumulator. When short immediate addressing is used, the addition is unaffected 
by SXM andis not repeatable. Note that when the ARP is updated during indirect address- 
ing, a shift operand must be specified. If no shift is desired, a 0 may be entered for this 
operand. | 


When adding with a shift of 16, the carry bit is set if the results of the addition generates 
a carry; otherwise, the carry bit is unaffected. This allows the accumulation to generate 
the proper single carry when adding a 32-bit number to the accumulator. 


1. (Direct, indirect, or short immediate addressing) 
2 (Long immediate addressing) 


1 (Direct, indirect, or short immediate addressing) 
2 (Long immediate addressing) 
ADD DAT1,1 ; (DP = 6) 
Before Instruction After Instruction 
Data Memory Data Memory 


301h See 301h 
ACC acc [0] 
Cc Cc 


ADD *+,0,AR0 : 
Before Instruction After Instruction 


AR4 AR4 
Data Memory Data Memory 
302h Cah 302h 


ACC a eh acc [0] L___ 04h 
Cc Cc 


ADD #1h ;Add short immediate 
Before instruction After Instruction 


acc X] C} vce (] (rH 
C Cc | 


ADD #1111h,1 ;Add long immediate with shift of 1 
Before Instruction After Instruction 


ace acc [6] [2224] 
Cc Cc ; 
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fit 


ADDC. Add to Accumulator With Carry 


Syntax 
Operands 


Opcode 


Execution 


Words 
Cycles 
Example 1 


Example 2 
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Direct: [labe] ADDC dma 
Indirect: [/abef ADDC {ind} [,next ARP 


0 <dmas 127 
0 < next ARP < 7 


15 14 13 12 11 #10 Q 7 6 5 4 3 2 1 =O 


8 
Directs |. 0. 4A 0 0 90. 0.0/0 Data Memory Address 
4 


15 14 13 12 11 109 8 
Indirect: | O 1 1 0 0 0 0 0 


6 5 4 3 2 i 90 


See Subsection 4.1.2 


(PC)+1 — PC 
(ACC) + (dma) + (C) > ACC 


Affected by OVM; affects OV and C. Not affected by SXM. 


The contents of the addressed data memory location and the value of the carry bit are add- 


edto the accumulator with sign extension suppressed. The Carry bit is then affected in the 
normal manner. 


‘The ADDC instruction can be used in performing multiple-precision arithmetic. 
1 | 


1 
ADDC DATO 7; (DP = 6) 
Before Instruction After Instruction 
Data Memory Data Memory 
300h 300h 


ACC acc = [o] [____ 78h 
Cc Cc 


ADDC. *-,AR4 ; (OVM = 0) 
Before Instruction After Instruction 
ARP aaa) ARP aaa 
ARO ARO 
Data Memory Data Memory 

300h 300h 
ACC ACC 

Cc Cc 

[0] 

OV OV 
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Syntax 
Operands 


Opcode 


Execution 


Description 


Words 
Cycles 
Example 1 
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Direct: [label ADDT dma 
Indirect: [label ADDT {ina} [,next ARP 


0<dma < 127 
0 < next ARP <7 


15 14 13 12 11109 8 7 6 5 4 38 2 14 0 


Direct: | 0 1 1 0 0 0 1 7 Data Memory Address 


4 8 1 : 8 7 
(PC) +1 — PC 
(ACC) + [(dma) x 2TREG1(3-0)} -, (ACC) 
lf SXM = 1: 
Then (dma) is sign-extended. 
If SXM = 0: 


Then (dma) is not sign-extended. 


Affected by SXM and OVM; affects OV and C. 


The data memory value is left-shifted and added to the accumulator, with the result replac- 
ing the accumulator contents. The left-shift is defined by the four LSBs of the TREG1, re- 
sulting in shift options from 0 to 15 bits. Sign extension on the data memory value is con- 
trolled by SXM. The carry bit is set when a carry is generated out of the MSB of the accu- 
mulator. 


Software compatibility with the TMS320C25 can be maintained by setting the TRM bit of 
the PMST status register to zero. This causes any TMS320C25 instruction that loads 
TREGO to write to all three TREGs. Subsequent calls to the ADDT instruction will shift the 
value by the TREG1 value (which is the same as TREGO), maintaining object-code com- 
patibility. 


1 
1 


ADDT DAT127 ; (DP = 4. SXM = 0) 


Before Instruction After Instruction 

Data Memory Data Memory 
027Fh 027Fh 
TREG1 OFF94h TREG1 OFF94h 


ACC OF715h] acc [0] OF7A5h 
Cc Cc 
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Reina RRR TTI WORT LA ti tit eet I ROD OPELIS DEE PIT ELLE E LITA ADI COE ELOL DUD TIL TITLED OLELTEAEEELELTDEOULLEISE 8 EDTPETIIAIEIL ETI DOLE TPIT SAUE IDI OET EDL SID IRELA ZERIT AOI ETILTE ASAD ULE D EET LOL A PAN ECEPEOOLER RO IRO  E 


Syntax [/abell ADRK #k 
Operands 0<k<255 
Opcode 
15 14 138 12 11:10 9 8 7 6 5 4 3 2 1 QO 


Short: | 0 1 1 1 1 0 oO OQ 8-Bit Constant 


Execution (PC)+1 — PC 
AR(ARP) + 8-bit positive constant + AR(ARP) 


Description The 8-bit immediate value is added, right-justified, to the currently selected auxiliary regis- 
ter (as specified by the current ARP) with the result replacing the auxiliary register con- 
tents. The addition takes place in the ARAU, with the immediate value treated as an 8-bit 
positive integer. Note that all arithmetic operations on the auxiliary registers are unsigned. 


Words 1 


Cycles 1 
Example ADRK  #80h . 
Before Instruction After Instruction 
ARP 5] ARP 7 eee 
ARS ARS 43A1h 
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AND _ AND With Accumulator 


RNS 


Example 1 AND DAT16 


Data Memory 
0210h 


ACC 


Example2 AND * 


ARP 
ARO 


Data Memory 
0301h 


ACC 


Example3 AND 


% 


#00FFh, 4 


ACC 
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7 (DP = 4) 
Before Instruction 


OOFFh 
12345678h 


Before Instruction 


Before Instruction 


12345678h 


Data Memory 
0210h 


ACC 


ARP 
ARO 


Data Memory 
0301h 


ACC 


ACC 


SANSA SANS SL EEE SE EEN ELE RL SLL IEEE LENE LE LL ESTELLE SELES EES ALLL SESE IED LESS SELLA SEES 


After Instruction 


OOFFh 
00000078h 


After Instruction 


0301 


OFFOO 
00005600 


After Instruction 


00000670h 
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APAC Add P Register 


Syntax 
Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Example 
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[label] APAC 


None 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #Q 
1 0 14 4 4 41 14 0 0 0 0 0 0 4 0 90 
(PC)+1— PC 


(ACC) + (shifted P register) —- ACC 


Affected by PM and OVM; affects OV and C. 
Not affected by SXM. 


The contents of the P register are shifted as defined by the PM status bits and added to 
the contents of the accumulator. The result is placed in the accumulator. APAC is not af- 
fected by the SXM bit of the status register; the P register is always sign-extended. The 
APAC instruction is a subset of the LTA, LTD, MAC, MACD, MADS, MADD, MPYA, and 
SQRA instructions. 


1 
1 


APAC 7 (PM = 01) 


Before Instruction After Instruction 

P P 

acc [x] acc [p] 
Cc Cc 
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APL AND Data Memory Value With DBMR or Long Constant 


Stas ebt abn stat att settee SRR tet iT tp Re ATS tet te ORT RTE SEN NCEE IDI IIE IIE TETAS GUAM TEAR REAL TENA EEE PERE TP EAI II ESA AIRS SAINI CEE SCRERAIEES 


Example2 aPL  DAT96_-; (DP = 0) 


Before Instruction After Instruction 
DBMR OFFOOh DBMR OFFOOh 
Data Memory Data Memory 


60h ch [O] TTOOh 
TC TC 


Example3 APL #0100h, *, AR6 


Before Instruction After Instruction 
ARP aaa: ARP [0] [___ 
TC TC 
ARS ARS 
Data Memory Data Memory 
300h 300h 
Example4 APL *, BRT . 
Before Instruction After Instruction 
ARP Ld ARP [0] 
TC TC 
AR6 AR6 
DBMR DBMR 
Data Memory Data Memory . 
310h 310h 
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BACC Bra 


ae ee cn ee 


Syntax 
Operands 


Opcode 


Execution 


Description 


Words 
Cycles 
Example 1 


Example 2 


4-44 


nch to Location Specified by Accumulator 
POSS OSE LOT LOD LT LOT LETS LIS IS LISA A LEE AT ASAT EDEL ENED IESE 


EEE IE ILO ET ELL GOEL ES EL SELES TELESIS ERLISTE ESS LIDS 


[label] BACC[D] 


None 
BACC 
15 14 13 12 11109 8 7 6 5§ 4 3 2 1 += 0 
1 0 4 #14 4 4 4+ OO 0 OO 41 0 0 0 0 0 
BACCD © 


15 14 13 12 141 10 9 8 7 6 5 4 3 2 1 0 
1 0 1 1 1 1 1 0 0 OQ 1 0 0 Od QO 1 
ACC(15-0) + PC 


Control is passed to the 16-bit address residing in the lower half of the accumulator. The 
one two-word instruction or two one-word instructions following the branch instruction are 
fetched from program memory and executed before the branch is taken, if the branch is 
a delayed branch (specified by the “D” suffix). 


1 
4 2 (If delayed) 


BACC ; (ACC contains the value 191) 


191 is loaded into the program counter, and the program continues executing from that 
location. 


BACCD ; (ACC contains the value 191) 
MAR *+,AR1 
LDP #5 


After the current AR, ARP, and DP are modified as specified, program execution continues 
from location 191. 
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The program counter (PC) is incremented by 2, and execution continues from that loca- 
tion. 


Example2 BANZD PGMO 
LACC #01h 


LDP #5 
Before Instruction After Instruction 
ARP Cd ARP | 
ARO Cr] Aro 
DP DP () 
ACC ACC 


After the current DP and ACC are modified as specified, program execution continues 
from location 0. 
Example 3 MAR *,ARO 


LAR ARI, #3 
LAR ARO, #60h 
PGM191 ADD *+,AR1 
_ BANZ PGM191,AR0 


The contents of data memory locations 60h-63h are added to the accumulator. 
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Example 1 


Example 2 


4-48 


BCND PGM191,LEQ,C 


If the accumulator contents are less than or equal to zero and the carry bit is set, program 
address 191 is loaded into the program counter, and the program continues executing . 
from that location. If these conditions do not hold, execution continues from location PC 
+2. 


BCNDD PGM191,0V 
MAR *,AR1 
LDP #5 


After the current AR, ARP, and DP are modified as specified, program execution continues 
at location 191 if the overflow flag (OV) in status register STO is set. If the flag is not set, 
execution continues at the instruction following the LDP instruction. 
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BIT Test Bit 


Example1 BIT Oh,15 ; (DP = 6).Test LSB at 300h 


Before Instruction After Instruction 
Data Memory Data Memory 

300h 300h 

TC aT) TC aaa) 

Example2 BIT *,0,AR1 ;Test MSB at 310h 

Before Instruction After Instruction 

ARP ast ae 0) ARP haa 

ARO ARO 
Data Memory Data Memory 

310h 310h 

TC ae) TC 


4-50 _ Assembly Language Instructions 


BITT__ Test Bit Specified by TREG2_ 


Words 1 
Cycles 1 
Example? BITT 00h 7 (DP = 6). Test bit 14 of data at 300h 
Before Instruction After Instruction 
Data Memory Data Memory 

300h 4DC8h 300h 4DC8h 

TREG2 TREG2 

TC eee) TC 


Example2 BITT * ;Test bit 1 of data at 310h 


Before Instruction After Instruction 

ARP Tae) ARP 

ARI ARI 
Data Memory Data Memory 

310h 310h 

TREG2 TREG2 OEh 

TC aE) TC [ 
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BLEDD Block Move From Data Memory to Data Memory nu 


Execution 


Description 


4-54 


Block move data to data with DEST in BMAR 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #0 
Direct} 1 0 1 0 1 #14 #O 1{[0) Data Memory Address 
16 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
Indirect!} 1 0 1 O 1 1 #0 1[ 1) See Subsection 4.1.2 
(PFC) + MCS 
If long immediate: 
(PC) +2 — PC 
#ik — PFC 
Else: 
(PC) +1 = PC 


(BMAR) > PFC 


While (repeat counter) + 0: 
(src, addressed by PFC) - dst or src — (dst, addressed by PFC) 
Modify AR(ARP) and ARP as Specied: 
(PFC) + 1— PFC 
(repeat counter) —1 — repeat counter. 
(src, addressed by PFC) -dst or src — (dst, addressed by PFC) 
Modify AR(ARP) and ARP as specified. 
(MCS) > PFC 


The word in data memory pointed at by srcis copied to a data memory space pointed at 
by dst. The word of the source and/or destination space can be pointed at with along im- 
mediate value, with the contents of the BMAR register, or by adata memory address. Note 
that not all src/dst combinations of pointer types are valid. 


RPT can be used with the BLDD instruction in indirect addressing mode to move consecu- 
tive words in data memory. The number of words to be moved is one greater than the num- 
ber contained in the repeat counter RPTC at the beginning of the instruction. The source 
or destination address for the BLDD instruction specified by the long immediate address 
or BMAR register contents are automatically incremented in repeat mode. If a direct 
memory address is specified, its address is not automatically incremented in repeat mode. 
Note that the source and destination blocks do not have to be entirely on-chip or off-chip. 
Interrupts are inhibited during a BLDD operation used with the RPT instruction. When 
used with RPT, BLDD becomes asingle-cycle instruction once the RPT pipeline is started. 
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Example 5 RPtTK 
BLDD 
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2 
#300h, *+ 


ARP 
ARO 
300h 
301h 
302h 
320h 
321h 
322h 


Before Instruction 


7F98 
OFFE6 


a 


ARP 
ARO 
300h 
301h 
302h 
320h 
321h 
322h 


After Instruction 


7F98 
OFFE6 

9522 

7F98 
OFFE6h 


sa 
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Memory 


Example2 BLDP- *,ARO 


Before Instruction After Instruction 
ARP ARP fire 0 
AR7 AR7 
Data Memory Data Memory 
310h OFOFOh 310h OFOFOh 
BMAR 2800h BMAR 2800h 
Program Memory Program Memory 

2800h 1234h 2800h OFOFOh 
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BLPD Block Move From Program Memory to Data Memory 


Words 
Cycles 


Example 1 


Example 2 


or the contents of the BMAR register. The data memory destination space is always 
pointed at by a data memory address or auxiliary register pointer. Note that not all src/dst 
combinations of pointer types are valid. 


RPT can be used with the BLPD instruction if more than one word is to be moved. The 
number of words to be moved is one greater than the number contained in the repeat 
counter, RPTC, at the beginning of the instruction. The source address specified by the 
long immediate or BMAR value is automatically incemented in repeat mode. Note that the 
source and destination blocks do not have to be entirely on-chip or off-chip. Interrupts are 
inhibited during a repeated BLPD instruction. When used with RPT, BLPD becomes a 
single-cycle instruction once the RPT pipeline is started. 


1 (Source is specified by the BMAR register) 
2 (Source is specified by a long immediate) 


2 (Source is specified by the BMAR register) 
3 (Source is specified by a long immediate) 


BLPD #800h, 00h ; (DP=6) 


Before Instruction After Instruction 
Program Memory Program Memory 
800h OFh 800h 
Data Memory Data Memory 
300h 300h OFh 
BLPD #800h, *,AR7 
Before Instruction After instruction 
ARP [Ld ARP 
ARO ARO 310h 
Program Memory Program Memory 
800h 1111h 800h 1111h 
Data Memory Data Memory 
310h 310h 1111h 
Example3  BLPD  BMAR, 00h ; (DP=6) . 
Before Instruction After Instruction 
BMAR 800h BMAR 800h 
Program Memory Program Memory 
800h OFh 800h 
Data Memory Data Memory 
300h 300h OFh 


4-60 


Syntax [label BSAR shift 
Operands 1<shift< 16 


Opcode 


15 14 13 #12 #11 


1 


oO 1 1 #1 


T See Section 4.5. 


Execution (PC) +1 — PC 


(ACC) / 2shift + ACC 


Affected by SXM. 


Description The BSAR instruction executes a 1- to 16-bit right-barrel arithmetic shift of the accumula- 
tor in a single cycle. The sign extension is determined by the sign-extension mode bit in 


status register 1 
Words 1 
Cycles 1 


Example 1 BSAR 16 


ACC 


Example2 BSAR 4 


ACC 


4-62 


(ST1). 


3; (SXM=0) 
Before Instruction 


00010000h 


3 (SXM=1) 
Before Instruction 


OFFF10000h 


ACC 


ACC 


After Instruction 


00000001h 


After Instruction 


OFFFF1000h 


Assembly Language Instructions 


CALL Call Unconditionally 


Syntax 
Operands 


Opcode 


Execution 


Description 


Words 
Cycles 
Example 1 


4-64 


[label] CALL[D] pma [,{ina} [,next ARP] 


0<pma s 65535 
O< next ARP <7 


CALL 


15 14 13 12 1110 9 8 7 6 5 4 3 2 14.0 
014% %4 1 0 14 0 See Subsection 4.1.2 
16-Bit Constant 


CALLD 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 jj QO 
0 1 1 1 1 1 1 0 See Subsection 4.1.2 
16-Bit Constant 
Non-delayed: PC +2 — TOS 

Delayed: PC+4-— TOS 


pma — PC 
Modify AR(ARP) and ARP as specified. 


The current program counter (PC) is incremented and pushed onto the top of the stack 
(TOS). Then, the contents of the program memory address (pma), either a symbolic or 
numeric address, are loaded into the PC. Execution continues at this address. The current 
auxiliary register and ARP are modified as specified. If the call is a delayed call (specified 
by the “D” suffix), the one two-word instruction or two one-word instructions following the 
call instruction are fetched from program memory and executed before the call is ex- 
ecuted. 


2 A 
4 2(If delayed) 
CALL PRG191,*+,AR0 


Before Instruction After Instruction 
ARP ARP eT) 
ARt ARt 
PC PC 
TOs Tos [32H] 


OBFh is loaded into the program counter, and the program continues executing from that 
location. 


Assembly Language Instructions 


cc Call Conditionally 


Syntax [label] CC[D] pma [cond] [,condZ |... 
Operands 0<pma< 65535 


Conditions: ACC=0 EQ 
ACCz0 NEQ 
ACC<0 LT 
ACCs0 . LEQ 
ACC>0 GT 
ACC20 ; GEQ 
C=0 NC 
C=1 C 
OV=0 NOV 
OV=1 OV 
TC=0 NTC 
TC=1- TC 
BIO low BIO 
Unconditionally UNC 

Opcode 
CC 


15 14 13 12 11:10 9 8 7 6 5 4 3 2 741 9 


1 41 4 0 4 of Tet |  zvet ZLVC t 
16-Bit Constant 


15.14, 13 12 ~«11:+«10 9 8 7 6 5 4 3 2 i 0 


1-4 he WO et. . ver ZLvc t 
16-Bit Constant 


t See Section 4.5. 


Execution _ \|f(condition(s)) 
Then 
Nondelayed: PC +2 ~ TOS 
Delayed: PC +4 — TOS 
pma — PC 
Else 
PC +2 — PC 


Description Controlis passed to the program memory address pma ifthe specified conditions are met. 
Note that not all combinations of conditions are meaningful. In addition, the NTC, TC, and 
BlO conditions are mutually exclusive. If the call is a delayed call (specified by the “D” suf- 
fix), the two one-word instructions or the one two-word instruction following the call are 
fetched from program memory and executed before the call is executed. The CC instruc- 
tion operates like the CALL instruction if all conditions are true. 


Words 2 


CCD 


4-66 Assembly Language Instructions 


Syntax [!abel] CLRC control bit 
Operands Control bit: STO, ST1 bit (from the following set): 
| {C, CNF, HM, INTM, OVM, TC, SXM, XF} 


Opcode 
Reset overflow mode (OVM) 


15 14°13 #12 +11°10 9 8 7 6 5 4 #3 2 
1 O 14 4 4 4 41 0 0 4 0 0 0 0 1 =O 


Reset sign extension mode (SXM) 


15 14 13 + #12 11°10 9 8 7 6 5 4 3 2 i 0 
1 0 1 14 14 1 1 0 0 41 0 0 0 14 1 =O 


Reset hold mode (HM) 


15 14 13 12 11 #10 9 8 7 6 5 4 3 2 74 OQ 

1 oO 1 1 1 1 10 0 4 0 0 4 0 0 0 
Reset TC bit 

15 14 13 12 11 #10 9 8 vA 6 5 4 3 2 1 0 

1 0 1 1 1 1 1 0 0 4 0 0 4 0 +4 =O 
Reset carry (C) 

15 14 13 12 11 10 9 8 7 6 § 4 3 2 141 QO 

1 O 1 1 1 1 1 0 0 4 0 0 4 1 1 #O 
Reset CNF bit 


15 14°13 #12 #W1:°#10 9 8 7 6 5 4 3 2 ji 0 
1 OO 14 4 4 4 41 0 0 41 0 0 0 141 0 40 


Reset INTM bit 


1614 13 12 11 10 9 8 7 6 5 4 3 2 i 0 
1 O 4 1 4 4 4 0 0.1 0 0 0 0 0 0 


Reset XF pin 
15 14 13 #12 11 #10 9 8 7 6 5 4 3 2 _ | 0 
1 0 14 14 14 4 4 OO 8O 141 0 0 4 4 0 90 
Execution (PC)+1 — PC 
0 — control bit 


Description The specified control bit is set to a logic zero. Note that the LST instruction may also be 
used to load STO and ST1. See subsection 3.6.3, Status and Control Registers, for more 
information on each of these control bits. 


Words 1 
Cycles 1 


4-68 Assembly Language Instructions 


Syntax [label] CMPL 
Operands None 


Opcode 
15 14 13 °#12 11°10 9 8 7 6 5 4 3 2 i @Q 
1 0 74 14 4 4 4 0 0 0 0 0 0 0 0 +4 


Execution (PC)+1— PC 
(ACC) — ACC 


Description The contents of the accumulator are replaced with its logical inversion (ones comple- 
ment). The carry bit is unaffected. 


Words 1 
Cycles 1 
Example CMPL 
Before Instruction After Instruction 
ACC 0F7982513 ACC 0867DAECh 
Cc Cc 


4-70 Assembly Language Instructions 


CPL Compare DBMR or Long Immediate With Data Value 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


4-72 


Direct: [label] CPL [,#/k] dma 
Indirect: [label] CPL [,#/k] {ind} [,next ARP] 


0<dma< 127 
Ik: 16-bit constant 
0 < next ARP <7 
Compare DBMR to data value 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #20 


Direct! |} 0 1 O 41 1 O 14 #1/04 Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
Indirect:} 0 1 0 141 #141 #0 1 1/1/14 See Subsection 4.1.2 


Compare data with long immediate 


15 14113 «12 611210 9 8G DCU BU 


‘ 0 10 4 4 #4 #4 1 Data Memory Address 
Direct: 
16-Bit Constant 


1514 13 12 ~«11:~210~ 29 


8 7 § § 4 3 2 jj Q 
0 1 0 414 4 1 #71 #41 See Subsection 4.1.2 
ate tO A AG a 
16-Bit Constant 


Ik unspecified: 
(PC) +1 — PC 


Compare DBMR contents to (dma). 
If (DBMR) = (dma), 
T . 


Else, 
TC =0. 


Ik specified: 
(PC) +2 — PC 


Compare Ik to (dma). 
If Ik = (dma), 

TC =1; 
Else 

TC =0. 


Affects TC. 
Not affected by SXM. 


If the two quantities involved in the comparison are equal, the TC bit is set to one. TC is 
set to zero otherwise. 


1 (Iflong immediate value is not specified) 


2 ___ (If long immediate value is specified) 


Assembly Language Instructions 


CRGT _ Test for ACC>ACCB 


Syntax 
Operands 


Opcode 


Execution 


Description 


Words 
Cycles 
Example 1 


Example 2 


4-74 


[label CRGT 


None 

15 14 13 12 11109 8 7 6 5 2 

1 0 %14 4 14 14 4 0 0 0 0 4 4 0 +4 41 
(PC) +1 — PC 


If (ACC) > (ACCB) 

Then (ACC) — ACCB;1— C 
If (ACC) < (ACCB) 

Then (ACCB) —- ACC;0—- C 
If (ACC) = (ACCB) 

Then i- C 


Affects C. 


The contents of the accumulator (ACC) are compared to the contents of the accumulator 
buffer (ACCB). The larger value (signed) is loaded into both registers. If the contents of 
the accumulator are greater than or equal to the contents of the accumulator buffer, the 
carry bit is set to 1. Otherwise, it is set to 0. 


1 
1 
CRGT 
Before Instruction After Instruction 
ACCB ACCB 
ACC (ees bi] ACC CO 5h] 
C eT) C ee 
CRGT 
Before Instruction After Instruction 


ACCB Ch] ACCB 
ACC a) ACC 
C CtC~“‘itd C 


et ee 


Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 
Example 1 


Example 2 


4-76 


Direct: [label DMOV dma 
Indirect: [label DMOV {ind} [,next ARP] 


O0<sdmas 127 
0 < next ARP <7 


15 14 13 12 11:10 9 8 7 


6 5 4 3 2 i @Q 
Direct! |] 0 1 14 1 0 1 14 #1107 Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =O 
Indirect: [07 1 1 0 1 1 4] 17]  SeeSubsection412 | 
(PC)+1— PC 
(dma) > dma+1 
Affected by CNF and OVLY. 


The contents of the specified data memory address are copied into the contents of the next 
higher address. DMOV works only within on-chip data RAM blocks. It works within any 
configurable RAM block if that block is configured as data memory. In addition, the data 
move function is continuous across block boundaries. The data move function cannot be 
used on external data memory or memory-mapped registers. If used on external memory 
or memory-mapped registers, DMOV will read the specified memory location but will per- 
form no operations. 


When data is copied from the addressed location to the next higher location, the contents 
of the addressed location remain unaltered. 


The data move function is useful in implementing the z—! delay encountered in digital sig- 
nal-processing. The DMOV function is included in the LTD, MACD, and MADD instruc- 
tions (see the LTD, MACD, and MADD instructions for more information). 


1 
1 


DMOV DAT8 7 (DP = 6) 
Before Instruction After Instruction 
Data Memory Data Memory 
308h 308h 
Data Memory Data Memory 
309h nr) 309h 
DMOV *, ARI 
Before Instruction After Instruction 
ARP aaa) ARP eee 
AR1 30Ah AR1 30Ah 
Data Memory Data Memory 
30Ah 30Ah 
Data Memory Data Memory 
30Bh 30Bh 


Assembly Language Instructions 


Syntax 
Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


4-78 


ULL LLL ee ee ee ER CET 


[label] IDLE 


None 
15 14 13 12 11109 8 7 6 5 4 38 2 1 =#@Q 
1 014 4 4 4 4 0 0 0 4 0 0 0 0 
(PC) +1 — PC 


Affected by INTM. 


The IDLE instruction forces the program being executed to wait until an unmasked inter- 
rupt (external or internal) or reset occurs. The PC is incremented only once, and the device 
remains in an idle state until interrupted. 


The idle state is exited by an unmasked interrupt even if INTM is 1. In the case of INTM 
being 1, the program will continue executing at the instruction following the IDLE. If INTM 


is 0, then the program will branch to the corresponding interrupt service routine. Execution 


of the IDLE instruction causes the TMS320C5x to enter the power-down mode. During 
the idle mode, the timer and serial port peripherals are still active. Therefore, timer and 
peripheral interrupts, as well as reset or external interrupts, will remove the processor from 
the idle mode. 


1 


IDLE ;The processor idles until a reset or unmasked interrupt 
;occurs. 


Assembly Language Instructions 


IN _/nput Data From Port 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Example 1 


Example 2 


4-80 


Direct: [label] IN dma, PA 
Indirect: [label] IN {ind} ,PA[,next ARP] 


O0<dma< 127 
0 < next ARP <7 
O< PA < 65535 


15 14 (13 12 ~11+~210 = =9 


ie oa Sneea adie 
16-Bit Constant 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 j{ QO 
(PC) +2 — PC 
While (repeat counter) # 0 
Port address — address bus A15—A0 
Data bus D15—DO — dma 
Port address + 1 — Port address 
(repeat counter — 1) — repeat counter 


Indirect: 


The IN instruction reads a 16-bit value from an external I/O port into the specified data 
memory location. The IS line goes low to indicate an I/O access, and the STRB, RD, and 
READY timings are the same as for an external data memory read. Note that port address- 
es 50h-—5Fh are memory-mapped (see subsection 5.1.1), but the other port addresses are 
not. 


RPT can be used with the IN instruction to read in consecutive words from I/O space to 
data space. In the repeat mode, the port address (PA) is incremented after each access. 


2 


2 (Each access cycle time increases by i, the number of I/O memory wait states. This 
is the number of cycles the device must wait for external I/O memory accesses.) 


IN DAT7, PAS ;Read in word from peripheral 
70n port address 5. Store in 
;data memory location 307h (DP=6). 


IN *, PAO ;Read in word from peripheral on 
7port address 0. Store in data memory 
;location specified by the current 
yauxiliary register. 


Assembly Language Instructions 


INTR __ Soft interrupt _ 


[tater [Location 
TRAP 


[20 
ea 
[2a 
2 
[serceined [2a 
[~serened [2a 
[veered | 20h 
[werdned [2m 
[erred [20h 
[—serdetined | een 
[_ertined [24h 
[vserdtined | 36h 
[erties [38h 
[vsecatined | oan | 
[—sertined | 30h 
[—wsordetined | 38h 


Words 1 
Cycles 4 


Example . INTR 3 ;Control is passed to program memory location 6h 
7PC + 1 is pushed onto the stack. 


4-82 Assembly Language Instructions 


-ooe 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


Cycles 


4-84 


Load Accumulator With Shift 


Direct: [labell LACC dma [,shiftT] 
Indirect: [label] LACC {ind} [,shift1 [,next ARP] 
Immediate: [labe] LACC #ik [,shift] | 


0<dma< 127 

0 <next ARP <7 
O<shifti<16 (defaults to 0) 
~—32768 < Ik < 32767 
O<shift2<15 (defaults to 0) 


15 14 13 12 11: #10 9 8 7 6 5 4 3 2 ji 0 


Direct! | 0 O O 1 SHFTt | 0 | Data Memory Address 


15.14 13 #12 11:10 9 8 7 6 5 4 3 2 +1 #0 


Indirect: fOr 04 = 0. 1 SHFTt eal See Subsection 4.1.2 
15 14 13 12 11 £10 9 8 7 6 5 4 3 2 1 0 
10% 14 4 4 4 4 4 0 0 0 SHFT t 
Long: 
16-Bit Constant 


Load ACC with shift of 16 
i5 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
Direct} 0 1 1 O08 1 0 1 Of 0] Data Memory Address 


15 14 13 12 11:10 9 8 7 6 5 4 3 2 i 0 


Indirect: | 0 1 1 on | Oo #1 0 1 See Subsection 4.1.2 


t See Section 4.5. 
Direct or Indirect Addressing: 


(PC) +1 —» PC 
(dma) x 2shift! _, ACC 


Long immediate Addressing: 
(PC) +2 — PC 
Ik x gshift2 _, ACC 


Affected by SXM. 


The contents of the specified data memory address or a 16-bit constant are left-shifted 
and loaded into the accumulator. During shifting, low-order bits are zero-filled. High-order 
bits are sign-extended if SXM = 1 and zeroed if SXM = 0. 


1 (Direct or indirect addressing) 
_ (Long immediate addressing) 


(Direct or indirect addressing) 
(Long immediate addressing) 


hP— 0% 


Assembly Language Instructions 


LACL _ Load Low Accumulator and Clear High Accumulator _ 


SR PROIS SLED RIDES T LISLE LES ESS TILL IL LAL LESTE DOL ELLIO LILLIE TILE LTE ELL LITE GILT DILL OLLIE IIIT 


Syntax Direct: [label] LACL dma 
Indirect: [label] LACL {ind} [,next ARP] 
Immediate: [label] LACL #k 

Operands Q<dmas 127 
0 <next ARP <7 > 
0<ks<255 

Opcode 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 i 90 
1 


Direct: | 0 1 14 0 14 0 0 | 0 | Data Memory Address 


15 14 13 12 ii 10 8 7 6 § 


, 9 4 3 2 1 0 
Indirect!}} 0 1 #1 0 1 0 0 171) See Subsection 4.1.2 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 i QO 


Short Immediate: | 1 0 1 1 1 0 O0O 1 8-Bit Constant 
Execution (PC)+1— PC 


Direct or Indirect Addressing: 


0 — ACC(31-16) 
(dma) — ACC(15-0) 


Short Immediate Addressing: 


0 — ACC(31-8) 
k -» ACC(7-0) 


Not affected by SXM. 


Description The contents of the addressed data memory location or a zero-extended 8-bit constant 
are loaded into the 16 low-order bits of the accumulator. The upper half of the accumulator 

_is zeroed. The data is treated as an unsigned 16-bit number rather than a twos-comple- 

ment number. There is no sign-extension of the operand with this instruction, regardless 


of the state of SXM. 
Words 1 
Cycles 1 


4-86 | Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


4-88 


Direct: [labe] LACT dma 
Indirect: [label] LACT {ino} [,next ARP] 


0<dma<s 127 
0<next ARP <7 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 i oO 


Direct! } 0 1 #1 0 41 0 1 #71404 Data Memory Address 


514 19 12 11 10 8 7 


indirect: [01101011 [1] S00 Subsection S12 


(PC)+1— PC 
(dma) X 2TREG1(3-0) -, ACC 
If SXM = 1: . 
Then (dma) is sign-extended. 
If SXM = 0: 
Then (dma) is not sign-extended. 


Affected by SXM. 


The LACT instruction loads the accumulator with a data memory value that has been 
left-shifted. The left-shift is specified by the four LSBs of TREG1, resulting in shift options 
from 0 to 15 bits. Using TREG1’s contents as a shift code provides a dynamic shift mecha- 
nism. During shifting, the high-order bits are sign-extended if SXM = 1 and zeroed if SXM 


=0. 


LACT may be used to denormalize a floating-point number if the actual exponentis placed 
in the four LSBs of the T register and the mantissa is referenced by the data memory ad- 
dress. Note that this method of denormalization can be used only when the magnitude of 
the exponent is four bits or less. 


Software compatibility with the TMS320C25 can be maintained by setting the TRM bit of 
the PMST status register to zero. This causes any TMS320C25 instruction that loads 
TREGO to write to all three TREGs. Subsequent calls to LACT will contain the correct shift 
value in TREG1, maintaining eee compatibility. 


1 
1 


Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Example 1 


Example 2 


4-90 


fad Accumulator With Memory-Mapped Register 


Direct: [label] LAMM dma 
Indirect: [/abe] LAMM {ino} [,next ARP] 


O<dma< 127 
0 < next ARP < 7 


15 14 13 12 11:10 9 8 7 6 5 4 3 2 1 #0 
0 


Direct! } 0 O08 0 0 1 O 0 | 0 Data Memory Address 


16 14 13 12 11 10 9 8 7 6 5 
0 0 60 


4 3 2 1 0 
0 0 0 1 0 See Subsection 4.1.2 


Indirect: 


(PC)+1— PC 
(dma) — ACC 


The low word of the accumulator is loaded with the contents of the addressed 
memory-mapped register. The 9 MSBs of the data memory address are set to zero, re- 
gardless of the current value of DP or the upper 9 bits of AR(ARP). This instruction allows 
any location on data page zero to be loaded into the accumulator without modifying the 
DP field in status register STO. 


1 


1 (For processor memory-mapped registers) 
2 ‘(For peripheral memory-mapped registers) 


LAMM BMAR ; (DP = 6) 
Before Instruction After Instruction 
ACC ACC 
BMAR BMAR 
Data Memory Data Memory 
31Fh 31Fh 
LAMM * 
Before Instruction After Instruction 
ARP ARP 
ARI Rt 
ACC ACC 
PRD OFh PRD OFh 
Data Memory Data Memory | 
325h 325h 


Note that the value in data memory location 325h is not loaded into the accumulator. The 
value at data memory location 25h is loaded. 


Assembly Language Instructions 


LAR Load Auxiliary Register 


storage register, especially for swapping values between data memory locations without 
affecting the contents of the accumulator. 


Words 1 (Direct, indirect, or short immediate addressing) 
2 (Long immediate addressing) 


Cycles 2 
Example 1 LAR ARO,DAT16 ; (DP = 6) 
Before Instruction After Instruction 
Data Memory he Data Memory 
310h 310h 


ARO ARO 


Example2 LAR AR4,*- 


Before Instruction After Instruction 
ARP Pa a, ARP ene) 
Data Memory Data Memory 
300h [2h] 30h [ah] 
AR4 fe 300h | AR4 


Note: 


LAR in the indirect addressing mode ignores any AR modifications if the AR 

specified by the instruction is the same as that pointed to by the ARP. There- 

fore, in Example 2, AR4 is not decremented after the LAR instruction. 
ee | 


Example3 LAR AR4, #01h 


Before Instruction After Instruction 
AR4 OFFO9h AR4 
Example4 LAR AR4, #3FFFh ; 
Before Instruction After Instruction 
ARA AR SFFFH 


4-92 Assembly Language Instructions 


Example 2 LDP #0h 


DP 


Example 3 LDP *, ARS 


ARP 
AR4 


Data Memory 
300h 


DP 


4-94 


Before Instruction 
1FF 


| 
| 
a 


Before Instruction — 


1FFh 


DP 


ARP 
AR4 


Data Memory 
300h 


DP 


After Instruction 


After Instruction 


o 
= 


Assembly Language Instructions 


LMMR __ Load Memory-Mapped Register 


Example2 LMMR *,#300h,AR4 ;CBCR = 1Eh 


Before Instruction After Instruction 

ARP (Sa) ARO 

ARO LST Eh ARO 
Data Memory Data Memory 

300h | 300h 

CBCR CBCR 20h 


4-96 Assembly Language Instructions 


po choad Sits Bee 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 
Example 1 


4-98 


ster 


. 


Direct: [label] LST #n, dma 
Indirect: [label] LST #n, {ina} [,next ARP 


0<dmas 127 
n=0,1 
0 < next ARP < 7 


LST #0 
15 14 13 #12 11:10 9 8 7 6 5 4 3 2 ji 90 


Direct: |} 0 O O O 1 1 1 #O Data Memory Address 


15/14 13°12 11:10 9 8 7 6 5 4 3 2 1 0 
Indirect!} 0 0 0 O 1 #1 +1 «0 See Subsection 4.1.2 


LST #1 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 O 


Direct: | 0 O OO O 1 et eee | Data Memory Address 


15 14 13 12 11 #10 9 8 7 6 5 4 3 2 i 0 
1 


Indirect} 0 O OO O 1 1 1 See Subsection 4.1.2 


(PC) +1 — PC 
(dma) — status register STn 
dma (bits 13-15) — ARP (regardless of n) 


Affects ARB, ARP, OV, OVM, DP, CNF, TC, SXM, C, HM, XF, and PM. 
Does not affect INTM. — 


Status register STn is loaded with the addressed data memory value. Note that the INTM 


bit is unaffected by LST #0. In addition, the LST #0 instruction does not affect the ARB 


field in the ST1 register even though a new ARP is loaded. If anext ARP value is specified 
via the indirect addressing mode, the specified value is ignored. Instead, ARP is loaded 
with the value contained within the addressed data memory word. 

Se ee a a ee a ee es 


Note: 
When ST71 is loaded, the value loaded into ARB is also loaded into ARP. 
_———— ee | 


The LST instruction can be used for restoring the status registers after subroutine calls 
and interrupts. 


1 
2 


MAR *,ARO 

LST #0,*,AR1 ;The data memory word addressed by the contents of 
;auxiliary register ARO is loaded into status register STO, 
7except for the INTM bit. Note that even though a next 
7ARP value is specified, that value is ignored, and the old 
;ARP is not loaded into the ARB. 


Assembly Language Instructions 


Syntax 
Operands 


Opcode 


Execution 


Description 


Words 
Cycles 
Example 1 


Example 2 


4-100 


Direct: [label] LT dma 
Indirect: [label LT {ind} [,next ARP 


0 <dma < 127 
0 < next ARP <7 


15 14 13 12 11 10 9 8 
Direc: | 0 1 1 #1 #0 0 41 


6 5 4 3 2 i 90 


Data Memory Address . 


7 
1 [0 | 
15 14 13 12 11:10 9 8 7 6 5 4 3 2 1 0 


Indirect: | O 1 1 1 0 Oo 1 1 See Subsection 4.1.2 


(PC)+1— PC 
(dma) — TREGO 


If TRM = 0: 
(dma) > TREG1 
(dma) + TREG2 


Affected by TRM. 


TREGO is loaded with the contents of the specified data memory address (dma). The LT 
instruction may be used to load TREGO in preparation for multiplication. See the LTA, LTD, 
LTP, LTS, MPY, MPYA, MPYS,and MPYU instructions. If the TRM bit of the PMST register 
is 0, then TREG1 and TREG2 are also loaded to maintain compatibility with the 
TMS320C25. The TREGs are memory-mapped registers and may be read and written 
with any instruction that accesses data memory. Note that TREG1 is only 5 bits and 
TREG2 is only 4 bits. 


1 
1 


LT DAT24 ;(DP = 8. TRM = 1). 
Before Instruction After Instruction 
Data Memory Data Memory 
418h 418h 
TREGO CH] TREGO 
LT *,AR3 ;(TRM = 0) 
Before Instruction After Instruction 


ARP Ci‘ ARP ee) 
AR2 AR2 
Data Memory Data Memory 
4t8h 4t6h 
TREGO TREGO 
TREGI TREGI 
TREG2 [CS TREG2 62h 


Assembly Language Instructions 


LTA = Load TREGO and Accumulate Previous Product _ 


NNR NR ARIAL A TRIO IL ERE REEL AIPA PSL ESS PRISE RPE EN EDEL EDEL PLS EEE LDL PAL OLEL ED COPE LOLI LES 


Example2 LTA *,S 7; (TRM = 0) 


ARP 
AR4 


Data Memory 
324h 


TREGO 
TREG1 
TREG2 
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Before Instruction 


ARP 
AR4 


Data Memory 
324h 


TREGO 

TREG1 

TREG2 
P 


acc [o] 
Cc 


After Instruction 


Assembly Language Instructions 


AAA Se Et AEN Ne Arr ee On cece narra cae 


LTD, Load TREGO, Accumulate Previous Product, and Move Data. 


Example 2 LTD *,AR3 ;(TRM = 0) 


Before Instruction After Instruction 
ARP ARP =a) 
ARI ARI 
Data Memory Data Memory 
3FEh SFEh 
Data Memory Data Memory 
3FFh 3FFh 
TEGO «= [3h] TREGO 
TREG1 TREG1 62h 
TRec2 = sh) TREG2 
P P 
Ace C7} aces 
Cc Cc 


4-104 . Assembly Language Instructions 


LTP Load TREGO and Store P Register in Accumulator 


Example 2  LTp *,ARS ; (PM = 0, TRM = 0) 


Before Instruction After Instruction 

ARP ee ARP are 
AR2 AR2 

Data Memory Data Memory 
324h 324h 
TREGO L 3h} = REGO 
TREG! TREG! 
TREG2 [Sh] TREG2 
Pp P 
ACC 7, ACC OFh 
Cc Cc 


4-106 Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


4-108 


MAC Multiply and Accumulate 


Direct: [label MAC pma, dma 
Indirect: [label] MAC pma, {ind} [,next ARP] 


0<pma < 65535 
O0<dma< 127 
0 < next ARP <7 


15 14°13 #12 11°10 9 8 7 6 5 4 3 2 71 0 


; 1 0 1 0 0 0 4 0 Data Memory Address 
Direct: 


16-Bit Constant 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 (0) 
1 0 1 0 O 0 1 0 See Subsection 4.1.2 
16-Bit Constant 

(PC) +2 — PC 


(PFC) — MCS 
(pma) — PFC 


If (repeat counter) = 0: 
Then (ACC) + (shifted P register) — ACC, 
(dma) — TREGO 
(dma) x (pma, addressed by PFC) — P register, 
Modify AR(ARP) and ARP as specified 
(PFC) + 1— PFC 
(repeat counter) — 1 — repeat counter. 
Else (ACC) + (shifted P register) — ACC, 
(dma) — TREGO 
(dma) x (pma, addressed by PFC) — P register, 
Modify AR(ARP) and ARP as specified 
(MCS) — PFC 


Affected by OVM and PM; affects C and OV. 


The MAC instruction multiplies a data memory value (specified by dma) by a program 
memory value (specified by pma). It also adds the previous product, shifted as defined 
by the PM status bits, to the accumulator. 


Indirect: 


The data and program memory locations on the TMS320C5x may be any nonreserved, 
on-chip or off-chip memory locations. If the program memory is block BO of on-chip RAM, 
then the CNF bit must be set to one. When the MAC instruction is used in the direct ad- 
dressing mode, the dma cannot be modified during repetition of the instruction. 


When the MAC instruction is repeated, the program memory address contained in the 
PFC is incremented by one during its operation. This makes it possible to access a series 
of operands in memory. MAC is useful for long sum-of-products operations because it be- 
comes a single-cycle instruction, once the RPT pipeline is started. 


Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Direct: [label MACD pma, dma - 
Indirect: [label MACD pma, {ind} [,next ARP 


0 < pma < 65535 
O<dma< 127 
0<next ARP <7 


15 14°13 ~ 12) 17'—C 10 Fo 


8 7 6 5 4 38 2 14 0 
1 0 1 0 0 O 1 1 Data Memory Address 
Pare (ec a ee ry 


16-Bit Constant 


15 14 13 12 11°10 9 8 7 6 5 4 3 2 141.0 
10% 00 01% 1 See Subsection 4.1.2 
16-Bit Constant 


(PC) +2 — PC 
(PFC) + MCS 
(pma) — PFC 


If (repeat counter) = 0: 
Then (ACC) + (shifted P register) — ACC, 
(dma) — TREGO 
(dma) x (pma, addressed by PFC) — P register 
Modify AR(ARP) and ARP as specified, 
(PFC) + 1 —» PFC 
(dma) — (dma) + 1 
(repeat counter) — 1 — repeat counter. 
Else (ACC) + (shifted P register) — ACC, 
(dma) — TREGO 
(dma) x (pma, addressed by PFC) — P register 
(dma) — (dma) + 1 
Modify AR(ARP) and ARP as specified, 
(MCS) > PFC 


Affected by OVM and PM; affects C and OV. 


Indirect: 


Description The MACD instruction multiplies a data memory value (specified by dma) by a program. 
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memory value (specified by pma). It also adds the previous product, shifted as defined 
by the PM status bits to the accumulator. The data and program memory locations on the 
TMS320C5x may be any nonreserved, on-chip or off-chip memory locations. If the pro- 
gram memory is block BO of on-chip RAM, then the CNF bit must be set to one. When 
MACDis used inthe direct addressing mode, the dmacannotbe modified during repetition 
of the instruction. If MACD addresses one of the memory-mapped registers or external 
memory as a data memory location, the effect of the instruction will be that of a MAC in- 
struction (see the DMOV instruction description). 


Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


4-112 


Direct: {label MADD dma 
Indirect: [label] MADD {ing} [,next ARP 


0<dma < 127 
0 < next ARP <7 


15.14 13 #12 11:10 9 8 7 6 5 4 3 2 1 #0 
oO 1 1 


Direct: |} 1 O 1 O 1 Data Memory Address 
15 14 138 12 11 10 9 8 7 6 5 4 3 2 {f 0 
Indirect:} 1 0 1 O 1 0 4 4/14) See Subsection 4.1.2 
(PC) +2 — PC 
(PFC) — MCS 


(BMAR) — PFC 


If (repeat counter) = 0: 
Then (ACC) + (shifted P register) — ACC, 
(dma) — TREGO 
(dma) x (pma, addressed by PFC) — P register, 
Modify AR(ARP) and ARP as specified, 
(PFC) + 1 — PFC 
(dma) — (dma) + 1 
(repeat counter) — 1 — repeat counter. 
Else (ACC) + (shifted P register) — ACC, 
(dma) — TREGO 
(dma) x (pma, addressed by PFC) — P register 
(dma) — (dma) + 1 
Modify AR(ARP) and ARP as specified. 
(MCS) — PFC 


Affected by OVM and PM; affects C and OV. 


The MADD instruction multiplies adata memory value (specified by the dma) by aprogram 
memory value. The program memory address is contained in the BMAR register; it is not 
specified by a long immediate constant. This facilitates dynamic addressing of coefficient 


‘tables. In addition, the previous product, shifted as defined by the PM status bits, is added 


to the accumulator. The data and program memory locations on the TMS320C5x may be 
any nonreserved, on-chip or off-chip memory locations. If the program memory is block 
BO of on-chip RAM, then the CNF bit must be set to one. When MADD instruction is used 
inthe direct addressing mode, the dmacannot be modified during repetition of the instruc- 
tion. If MADD addresses one of the memory-mapped registers or external memory as a 
data memory location, the effect of the instruction will be that of a MADS instruction (see 
the DMOV instruction description). 


MADD functions in the same manner as MADS, with the addition of data move for on-chip 
RAM blocks. Otherwise, the effects are the same as for MADS. This feature makes MADD 
useful for applications such as convolution and transversal filtering. 


Assembly Language Instructions 


Syntax 
Operands 


Opcode 


Execution 


Description 
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MADS Multiply and Accumulate With Dynamic Addressing 


Direct: [label .MADS dma 
Indirect: [label MADS {ina} [,next ARP| 


O0<dma< 127 
0 < next ARP <7 


15. 14:13 :#12 ~#11:°=10 9 8 7 6 5 4 3 2 1 =O 
Direct! |} 1 0 1 0 4 O 1 Of 0] Data Memory Address 
is u 8 7 
Indirect: 0 = Sica 2 
(PC) +1 — PC 
(PFC) — MCS 


(BMAR) > PFC 


If (repeat counter) # 0: 
Then (ACC) + (shifted P register) — ACC, 
(dma) — TREGO — 
(dma) x (pma, addressed by PFC) — P register, 
Modify AR(ARP) and ARP as specified, 
(PFC) + 1 — PFC 
(repeat counter) — 1 — repeat counter. 
Else (ACC) + (shifted P register) — ACC, 
(dma) — TREGO 
(dma) x (pma, addressed by PFC) —> P register, 
Modify AR(ARP) and ARP as specified, 
(MCS) — PFC 


Affected by OVM and PM; affects C and OV. 


The MADS instruction multiplies a data memory value (specified by dma) by a program 
memory value (specified by pma). It also adds the previous product, shifted as defined 
by the PM status bits, to the accumulator. The pma is specified by the contents of the 
BMAR register, rather than by a long immediate constant. This allows for dynamic ad- 
dressing of coefficient tables. 


The data and program memory locations on the TMS320C5x may be any nonreserved, 
on-chip or off-chip memory locations. If the program memory is block BO of on-chip RAM, 
then the CNF bit must be set to one. When MADS is used in the direct addressing mode, 
the dma cannot be modified during repetition of the instruction. 


When the MADS instruction is repeated, the program memory address contained in the 
PFC is incremented by one during its operation. This makes it possible to access a series 
of operands in memory. MADS is useful for long sum-of-products operations because this 
instruction becomes a single-cycle instruction, once the RPT pipeline is started. 


Assembly Language Instructions 


MAR Modify Auxiliary Register 


LILO LLL LODE ELLIE LLL OLEATE LISD ELLIE GDS LE ELIE ALLELES LAS ESSER I D IDLE LONGEST ICEL GL ES. 


Syntax Direct: [label MAR dma 
Indirect: [label]) MAR {ind} [,next ARP] 
Operands (Q<nextARP <7 


Opcode 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 +1 #20 


Direct! } 1 0 0 O 1 0 1 #1404 Data Memory Address 


15 14 13 12 11: #10 9 8 7 6 5 4 3 2 i 0 


Indirect: | 1 0 0 0 1 O 1 1 i ae See Subsection 4.1.2 
Execution (PC)+1— PC 


Modifies ARP, AR(ARP) as specified py the indirect addressing fiela. Acts as a NOP in 
direct addressing mode. 


Description \nthe indirect addressing mode, the auxiliary registers and the ARP are modified; howev- 
er, no use is made of the memory being referenced. MAR is used to modify the auxiliary 
registers or the ARP. The old ARP is copied to the ARB field of the status register ST1. 
Note that any operation that MAR performs can also be performed with any instruction that 
supports indirect addressing. ARP can also be loaded by an LST instruction. The instruc- 
tion LARP from the TMS320C25 instruction set is a subset of MAR (i.e., MAR *,4 performs 
the same function as LARP 4). 


Words 1 
Cycles 1 


Example 1 MAR *,AR1L ;Load the ARP with 1. 
Before Instruction After Instruction 


ARP [ ARP be 
ARB ARB Eee) 


Example2 MAR *+,AR5 ;Increment current auxiliary register 
; (AR1) and load ARP with 5. 


_ Before Instruction After Instruction — 
ARI ARI 


ARP ARP [ 
ARB — ARP 


4-116 Assembly Language Instructions 


MPY Multiply _ 


SESS SESS EET L ELS SL I EG LES LOE LESESILESE ISLES SESE SESE SRS EESLEL SEES ESSE SELES SES ESSE SEE ESE SE EEE SES USSESES SESE S ES SAG i 6 arden het eornencneneincndddd 


eyeles 1 (Direct, indirect, or short immediate addressing) 
2 (Long immediate addressing) | 


Example 1 py DAT13 ; (DP = 8) 


Before Instruction After Instruction 
Data Memory Data Memory 
40Dh 40Dh 
TREGO TREGO 
P P 2Ah 
Example2 py  +*,AR2 | 
Before Instruction After Instruction 
ARP =a) ARP aa 
ARI ARt 
Data Memory Data Memory 
40Dh 40Dh 
TREGO TREGO 
P P 2Ah 
Example3 Mpy #031h 
Before Instruction After Instruction 
TREGO [CHT TREGO 
P P 62h 
Example4 mpy #01234h 
Before Instruction After instruction 
TREGO CH TREGO 
P p 


4-118 Assembly Language Instructions 


MPYS Multiply and Subtract Previous Product 


Nts SNL OTIS LLL LLL LTT LLL TL ESL LLL SATS LENE PPE LLL LLLP ELIE LE DLL LLL LLL LA LEE LL PAL ELL EEL LATELY PALL ILLITE LIED ID GOVE LLLLIG. 


Syntax Direct: [label MPYS dma 
Indirect: [label] MPYS {ino} [,next ARP| 


Operands Q<dmas 127 
0 < next ARP < 7 


Opcode 
15 14 13 12 11 #10 9 8 6-6». 4 8 (2 - Ti: 0 


7 
Direct | 0 1 0 1 0 0 0 11]0] Data Memory Address 
7 


15 14 13 12 11 10 9 8 6 5 4 3 2 1 0 


0 1 0 0 0 1] 1 See Subsection 4.1.2 


—_ 


Indirect: | 0 


Execution (PC)+1 — PC 
(ACC) — (shifted P register) — ACC 
(TREGO) x (dma) — P register 


Affected by OVM and PM; affects C and OV. 


Description Thecontents of TREGO are multiplied by the contents of the addressed data memory loca- 
tion. The result is placed in the P register. The previous product, shifted as defined by the 
PM status bits, is also subtracted from the accumulator, and the result is placed in the ac- 


cumulator. 
Words 1 
Cycles 1 ; 
Example? mMPyYS DAT13 ; (DP = 6, PM = 0) 
Before Instruction After Instruction 
Data Memory Data Memory 

30Dh 30Dh 

TREGO TREGO 

P P 

ACC 54h ACC 1Eh 

Cc Cc 


Example2 MPYS *,AR5 ;(PM = 0) 
Before Instruction After Instruction 
. ARP 
AR4 30D 


ARP 
AR4 


Data Memory Data Memory 
30Dh 30Dh 


TREGO TREGO 
P 
ACC 

Assembly Language Instructions 
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Syntax [label] NEG 
Operands None 


Opcode 


15 14 13 #12 #11 
1-2 OP te TA 


Execution (PC)+1 —» PC 
(ACC) x -1 > ACC 


Affected by OVM; affects OV and C. 


Description The contents of the accumulator are replaced with its arithmetic complement (twos com- 
plement). The OV bit is set when taking the NEG of 80000000h. If OVM = 1, the accumula- 
tor contents are replaced with 7FFFFFFFh. If OVM=0, the result is 80000000h. The carry 
bit C on the TMS320C5x is reset to zero by this instruction for all nonzero values of the 


accumulator, and is set to one if the accumulator equals zero. 


| Words 1 
Cycles 1 


Example 1 NEG 


Example2 NEG 


Example3 NEG 
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3; (OVM = X) 
Before Instruction 


ACC 
Cc 


OV 


3 (OVM = 0) 
Before Instruction 


ACC 
Cc 


OV 


7; (OVM = 1) 


Before Instruction 


ACC 080000000h 
Cc 


OV 


ACC 


ACC 


ACC 


After Instruction 


[o] L_____ opt] 
C 
ov 


After Instruction 


[o] L___o8ooo0000H) 
Cc 


OV - 
After Instruction 
[o] 7FFFFFFEh 
Cc 


OV 


Assembly Language Instructions 


NOP _ No Operation 
Soesnceatenainnseoeeenens eee Se eee ES EON SERS ENR 


Syntax [label] NOP 
Operands None 
Opcode . 


15 14 13 #12 #11:#10 9 8 7 6 5 4 3 2 +1 #9 
1 0 0 0 4 0 1 14 0 0 0 0 0 0 0 0 


Execution (PC)+1— PC 


Description No operation is performed. The NOP instruction affects only the PC. The NOP 
instruction is useful to create pipeline and execution delays. 


Words 1 
Cycles 1 
Example NOP ;No operation is performed. 


4-124 Assembly Language Instructions 


RARER, 


NORM Normalize Contents of Accumulator 


Example 1 NORM *+ 


Before Instruction After Instruction 
ARP ARP 
AR2 AR2 
ACC OFFFFFOO1 acc [0] OFFFEOO2h 
TC TC 
Example 2  31-Bit Normalization: 
MAR . *,ARI1 7;Use AR1 to store the exponent. 
LAR AR1, #0h 7Clear out exponent counter. 
LOOP NORM *+ 7One bit is normalized. 
BCND LOOP., NTC 7;If TC = O, magnitude not found yet. 
Example3  15-Bit Normalization: 
MAR *,ARI1 7;Use AR1 to store the exponent. 
LAR ARi,#0Fh ;Initialize exponent counter. 
RPT #14 715-bit normalization is specified (yielding 
ja 4-bit exponent and 16-bit mantissa). 
NORM *- 7NORM automatically stops shifting when 


;the first significant magnitude bit is found, 
;performing NOPs for the remainder of the 
;repeat loops. 


The method in Example 2 is used to normalize a 32-bit number and yields a 5-bit exponent 
magnitude. The method in Example 3 is used to normalize a 16-bit number and yields a 
4-bit magnitude. If the number requires only asmall amount of normalization, the Example 
2 method may be preferable to the Example 3 method. This is because the loop in Exam- 
ple 2 runs only until normalization is complete. Example 3 always executes all 15 cycles 
of the repeat loop. Specifically, Example 2 is more efficient if the number requires three 
or less shifts. If the number requires six or more shifts, Example 3 is more efficient. 


co SY 


Note: 


The NORM instruction may be used without a specified operand. In that 
case, any comments on the same line as the instruction will be interpreted 
as the operand. If the first character is an asterisk *, then the instruction will 
be assembled as NORM * with no auxiliary register modification taking 
place upon execution. Therefore, Tl recommends that you replace the 
NORM instructions with NORM *+ when you want the default increment 
modification. 


ee | 


4-126 | Assembly Language Instructions 


OPL _OR With DBMR or Long Immedi 


jate 


Example 1 oOPL DATI10 ; (DP=6) 
Before Instruction 
DBMR 
Data Memory 
30Ah 
Example2 oOPL #0FFFh,DAT10 ; (DP=6) - 
Before Instruction 


Data Memory 
30Ah 


Example3  OPL *,AR6 


ARP 
AR3 
DBMR 


Data Memory 
300h 


Example4 opL #1111h,*,AR3 


ARP 
AR6 


Data Memory 
306h 
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O00Th 


Before Instruction 


Before Instruction 


DBMR 


Data Memory 
30Ah 


Data Memory 
30Ah 


ARP 
AR3 
DBMR 


Data Memory 
300h 


ARP 
AR6 


Data Memory 
306h 


ie CE RR ASIEN OT TT LENT IS LC LLL ESL E LTE LLL OL LE ELIDA ELL LES ETE AL ELE LILES SEES STEELS, 


After Instruction 
OFFFO 


/ 


OFFF1 


After Instruction 


OFFF 


After Instruction 


300 
OFO 


OFF 


After Instruction 


306 


141Fh 


Assembly Language Instructions 


OR = _OOR With Accumulator 


RAISE RE ORES IETS E TS GD ETSI D SINE NE Mae NE ANTE a IN RIL AR SLIT LRN ALS TALLIED LIES LL EGET ES ELLE LEIS ELL LLSLLT SEA ALL LL ELLER LES DELLE EEE LL EELS SEALED ES LIEBELS AL SALSA ELEN I AELLLLSD ELLER DLL SALLIE, 


Words 1 (Direct or indirect addressing) 
2 (Long immediate addressing) 
Cycles 1 (Direct or indirect addressing) 
| 2 (Long immediate addressing) 
Example1 oR DAT8 =; (DP = 8) 
Before Instruction After Instruction 
Data Memory Data Memory 
408h 408h 
ACC ACC 
Cc Cc 
Example2 OR *, ARO 
Before Instruction After Instruction 
ARP eee ARP a) 
ARI ARI 
Data Memory Data Memory 
300h 300h 
ACC ACC 
C C 
Example3 OR #08111h, 8 
Before Instruction After Instruction 
ACC ACC 
C - C 


4-130 Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Example 1 


Example 2 
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Direct: [label] OUT dma, PA 
Indirect: [/abel] OUT {ind}, PA [,next ARP] 


0<dma< 127 
0 < next ARP <7 
0< PAs 65535 


15 14 13 «12 ~=O11 = : 


: 
Sieer 0 0 0 0 : cae a ae Ree 


16-Bit Constant 


15 14°13 12 #11:°10 9 8 7 6 5 4 3 2 i O 


0 0 O 0 1 1 0 O { See Subsection 4.1.2 
cee ae ae es 
16-Bit Constant 


(PC) +2 — PC 

While (repeat counter) + 0 
Port address — address bus A15—-A0 
(dma) — Data bus D15—DO 
Port address + 1 —> Port address 
(repeat counter —- 1) — (repeat counter) 


The OUT instruction writes a 16-bit value from a data memory location to the specified 
/O port. The IS line goes low to indicate an I/O access, and the STRB, R/W, and READY 
timings are the same as for an external data memory write. Note that port addresses 
50h—5Fh are memory-mapped (see subsection 5.1.1); the other port addresses are not. 


RPT can be used with the OUT instruction to write consecutive words from data memory 
to I/O space. In the repeat mode, the port address (PA) is incremented after each access. 


2 
3 (Each output will increase by i, 1/O memory wait states. This is the num- 


ber of cycles the device must wait for external I/O devices to access 
data.) . 


OUT DATO,PA7 ;(DP = 4) Output data word stored in data memory 
zlocation 200h to peripheral on port address 7. 


OUT *, PALS 7Output data word referenced by current auxiliary 
;register to peripheral on port address 15. 


Assembly Language Instructions 


POP Pop Top of Stack to LowAccumulator 


Syntax 
Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 
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SIEGES SEE ITI OTIS LES IIIE ESERIES EEE IRIE NIA ELLE LOE SISNET EOE T SOS, 


[label POP 


None 


15 14 13 #12 11°10 9 8 7 6 5 4 3 2 i iO 
1 0 4 14 14 4 4-£0 0 0 4 4 0 0 74 #0 


(PC) +1 — PC 
(TOS) — ACC(15—0) 
0 — ACC(31-—16) 
Pop stack one level 


The contents of the top of the stack (TOS) are copied to the low accumulator, and the stack 
is popped after the contents are copied. The upper half of the accumulator is set to all ze- 
roes. 


The hardware stack is last-in, first-out with eight locations. Any time a pop occurs, every 
stack value is copied to the next higher stack location, and the top value is removed from 
the stack. After a pop, the bottom two stack words will have the same value. Because each 
stack value is copied, if more than seven stack pops (POP, POPD, RETC; RETE, RETI, 
or RET instructions) occur before any pushes occur, all levels of the stack contain the 
same value. No provision exists to check stack underflow. 


1 
1 


POP 
Before Instruction After Instruction 
ACC 


of 


Co] yoy | A] | + bh 
NY PQ] PRETO TN] [oO a1 
SID IS Loy ps i = 


Stack 


[¢] 
N 
=> 
o 
pet 
= 


ao 
pare 
= 


Assembly Language Instructions 


Example2 PoPpD *+,AR1 


ARP 
ARO 


Data Memory 
300h 


Stack 


4-136 


Before Instruction 


ARP 
ARO 


Data Memory 
300h 


Stack 


After Instruction 


301 


f SIT 1 
4} 1] IMT ip 
Sal (ai is J] ij 


Example2 PSHD 
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*, AR1L 


ARP 
ARO 


Data Memory 
1FFh 


Stack 


Before Instruction 


ARP 
ARO 


Data Memory 
1FFh 


Stack 


After Instruction 


1FF 


af] jou 
NOT TM] yy 
Paes ed J ji 


[e) 
i) 
3 


Oy yRTTOLIN 
O]JOTLIN] |OL] a 
Peewee. 


Assembly Language Instructions 


RETC _Return Conditionally i... 


Syntax 
Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Example 1 


Example 2 
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{label RETC[D] [cond7] [, cona2] [,... 


Conditions: ACC=0 EQ 
ACCx0 NEQ 
ACC<0 LT 
ACCs0 LEQ 
ACC>0 GT 
ACC20 GEQ 
C=0 NC 
C=1 C 
OV=0 . NOV 
OV=1 OV 
BIO low BIO 
TC=0 NTC 
TC=1 TC 
Unconditional UNC 

RETC: 


15 14 +13 a 10: .-9 


Zt ZING 


RETCD: 
15 14 #13 2 10. =9 


ZG ZO 


f (condition(s)) then 
(TOS) — PC 
Pop stack one level. 
Else, continue 


A standard return, RET, is executed if the specified conditions are met. Note that not all 
combinations of conditions are meaningful. The two one-word instructions or one 
two-word instruction following the RETC are fetched and executed before the execution 
of the return, if the delayed version is specified with the “D” suffix. If the delayed instruction 
is specified, the two instruction words following the RETCD instruction have no effect on 
the conditions being tested. 


1 


2 (Return not executed) 2 (If instruction delayed) 
4 (Return executed) 2 __ (If instruction delayed) 


RETC GEQ,NOV ;A return, RET, is executed if the accumulator contents are 
;positive and the OV bit is a zero. 


RETCD C ;A return, RET, is executed if the carry bit is set. The two 
MAR *,4 jinstructions following the return instruction will be 
LAR AR3, #1h ;executed before the return is taken. 


Assembly Language Instructions 


RET Return From Subroutine 


IS EE ATO LO I OTN OL EEL REET TTT LOTS ISDS ALLEL LASSIE DELETES IEEE ALELASL EELS EEE SEL SESSLER ESAT ASL LELL SLE ALESEEA NEE SSS SAE L EEL ERLE LENA NS ELLE SALES SEABED 


Example2  RETD 


MAR *,4 
LACC #1h 
Before Instruction After Instruction 
PC PC 
ARP ARP Ea 
ACC ACC 
Stack Stack 


~~“ 
ai 
zs 


F 


=] jo Ni 

sila) i> = 
O11 Ot ip} To] [po 
Sis isi [si is 


o 
m 
=a 


4-142 Assembly Language Instructions 


Syntax 
Operands 
Opcode 


Execution 


Description 


Words 


Cycles 
Example 


4-144 


SOS IE ESOT ELSES LIER RTI REIL IEE OEE EEO IE RSI I A I ET SI EL NE RNC TT I OES 


[label] RETI 
None 


15 14 13 #12 #11 #10 9 8 7 
1 1 #14 14 #14 #14 O 0 


oO 


(TOS) — PC 
Pop stack one level. 


The contents of the top stack register are copied into the program counter. The RETI in- 
struction also pops the values in the shadow registers (stored when an interrupt was tak- 
en) back into their corresponding strategic registers. The following registers are shad- 
owed: ACC, ACCB, PREG, STO, ST1, PMST, ARCR, INDX, TREGO, TREG1, and 
TREG2. The XF bit in status register ST1 is not saved or restored to/from the shadow reg- 
isters during interrupt service routines. 


1 
4 
RETI 
Before Instruction After Instruction 
PC PC 
Stack 37 Stack 
75h 


DLIOLIp| {ol ip 
SiO} io] oie 


oO 
m 
2 


Syntax [label] ROLB 
Operands None 


Opcode 


15 14 (13 12 11°10 9 8 7 6 5 4 3 2 i 0 
1 0O 4 14 #74 4 4 O O 


Execution (PC) +1 — PC 
C — ACCB(0) 
(ACCB(30-0)) — ACCB(31-1) 
(ACCB(31)) — ACC(0) 
(ACC(30-0)) — ACC(31-1) 
(ACC(31)) > C 


Affects C. 
Not affected by SXM. 


Description The ROLB instruction causes a 65-bit rotation. The contents of both the accumulator 
(ACC) and accumulator buffer (ACCB) are rotated to the left by one bit. The MSB of the 
original contents in the accumulator shifts into the carry position. The original value of the 
carry bit (C) shifts into the LSB position of the accumulator buffer, and the MSB of the origi- 
nal contents of the accumulator buffer shifts into the LSB position of the accumulator. 


Words 1 


Cycles 1 
Example ROLB 
Before Instruction After Instruction 
acc [1] acc [0] 
Cc C 
ACCB ACCB 


4-146 Assembly Language Instructions 


RORB_ Ao 


Syntax 
Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


4-148 


tate ACCB and Accumulator Right 


7 PEELED RIE LOEE ERS PEE AEE L EE LLL DELILE LEED PEELE EDEL ELIE AP EELS LEE EEL LEELA ELI SEP SEEE TLE SEER 


[label RORB 


None 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 41 #0 
1 0 1 1 1 1 1 0 0 0 0 1 0 1 0 1 
(PC) +1 — PC 
C — ACC(31) 


(ACC(31-1)) - ACC(30-0) 
(ACC(0)) — ACCB(31) 
(ACCB(31-1)) —» ACCB(30-0) 
(ACCB(0)) > C 


Affects C. 
Not affected by SXM. 


The RORB instruction causes a 65-bit rotation. The contents of both the accumulator 
(ACC) and accumulator buffer (ACCB) are rotated to the right by one bit. The LSB of the 
original contents in the accumulator buffer shifts into the carry position. The original value 
of the carry bit (C) shifts into the MSB position of the accumulator, and the LSB of the origi- 
nal contents of the accumulator shifts into the MSB position of the accumulator buffer. 


1 


{ 
RORB 
Before Instruction After Instruction 
Acc acc 
C c 
ACCB ACCB 


Assembly Language Instructions 


RPT RepeatNext Instruction = 


RPT is especially useful for block moves, multiply-accumulates, normalization, and other 
functions. The repeat instruction itself is not repeatable. 


Words 1 (Direct, indirect, or short immediate addressing) 
2 (Long immediate addressing) 
Cycles 2 
Example 1 RPT DAT127 ; (DP = 31) 
Before Instruction After Instruction 
Data Memory Data Memory 
OFFFh OFFFh 
RPTC RPTC 0Ch 
Example 2 RPT *, ARL 
Before Instruction After Instruction 
ARP eae ARP 
ARO ARO 
Data Memory Data Memory 
300h 300h OFFFh 
RPTC RPTC OFFFh 
Example3 RPT #1 ;Repeat next instruction 2 times. 
Before Instruction After Instruction 
RPTC RPTC 
Example4 Rpt #1111h ;Repeat next instruction 4370 times. 
Before Instruction After Instruction 
RPTC RPTC 


4-150 Assembly Language Instructions 


Syntax 
Operands 
Opcode 


Execution 


Description 


Words 
Cycles 
Example 


4-152 


Long immediate: [label] RPTZ #/k 
0 <|Iks 65535 


16 14 13 12 11:10 9 8 7 6 5 2 
1 0 174 14 14 4 4 0 4 4 OO O 0 4 O 1 


16-Bit Constant | 


0 — ACC 
0 — PREG 
(PC)+1— PC 


_Ik — RPTC 


The RPTZinstruction clears the accumulator and product register and repeats the instruc- 
tion following the RPTZ n times, where n = /k+7. RPTZ is equivalent to the following in- 
struction sequence: 


MPY #0 
PAC 
RPT #<1lk> 


2 
2 


RPTZ #7FFh #Zero product register and accumulator. 
MACD  pma, *+ 7;Repeat MACD 2048 times. 


Assembly Language Instructions 


SACH Store High Accumulator With Shift 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 
Example 1 


Example 2 


4-154 


Direct: [label] SACH dma |[,shiff| 
Indirect: [label] SACH {ind} [,shiff{,next ARP] 


0<dma< 127 
0 < next ARP <7 
O<shift< 7 (defaults to 0) 


Be te 6 5 4 3 2 1 =O 
Direct: [1001 1 | SHFF [0 | Data Memory Address 


14 13 #12 ii 10 Q9 


Indirect: Me eo SHF 7 en Se Specs 4.1 r 


T See Section 4.5. 


(PC)+1 — PC 
[(ACC) x 2shift] -, dma 


Not affected by SXM 


The SACH instruction copies the entire accumulator into a aeHiner where it left-shifts the 
entire 32-bit number from 0 to 7 bits. It then copies the upper 16 bits of the shifted value 
into data memory. The accumulator itself remains unaffected. 


1 
1 


SACH DAT10,1'° ; (DP = 4) 
Before Instruction After Instruction 


ACC Hi 4208001h ACC 4208001h 
Cc 


“Data Memory Data Memory 

20Ah 20Ah 
SACH *+,0,AR2 ; 
Before Instruction After Instruction 
ARP ARP Lt 
ARI ARI 
ACC ACC © 
Cc C 

Data Memory Data Memory 

300h 300h 


Assembly Language Instructions 


SAMM _ Store Accumulator in Memory-Mapped Register 


SSS SESE NSBSENSSBSRIDSRS NA KSSAASI NR i SELES SES SRS EM SR SA RAE Sain ki tid ti eee ca ktratnncedambe motes pccpees 


Syntax Direct: [label] SAMM dma 

Indirect: [label] SAMM {ind} [, next ARP] 
Operands (Q<dma<127 

0 <next ARP <7 
Opcode 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 O 
0 


Direct! |} 1 0 0 0 1 O 0 | 0 | Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 


4 3 2 1 0 
Indirect:| 10 0 0 1 0 0 0 Ex See Subsection 4.1.2 


Execution (PC)+1— PC 
(ACC) — dma(0-7) 

Description The low word of the accumulator is copied to the addressed memory-mapped register. 
The upper 9 bits of the data address are set to zero, regardless of the current value of DP 


or the upper 9 bits of AR(ARP). This instruction allows the accumulator to be stored to any 
memory location on data page 0 without modifying the DP field in status register STO. 


Words |. 1 

Cycles 1 (For processor memory-mapped registers) 
2 __— (For peripheral memory-mapped registers) 

Example 1 SAMM- PRD ; (DP = 6) 


Before Instruction After Instruction 

ACC ACC 

PRD PRD 
Data Memory Data Memory 

325h OFh 325h OFh 


Example2 SAMM *,AR2 ;(BMAR = 1Fh) 


Before Instruction After Instruction 
ARP ARP ae) 
AR7 AR7 
ACC ACC 
BMAR BMAR 
Data Memory Data Memory 
31Fh 1th 31Fh 11h 


4-156 Assembly Language Instructions 


SATH _ Barrel Shift ACC as Specified by TREG 


Syntax 
Operands 


Opcode 


Execution 


Description 


Words 
Cycles 
Example 1 


SATH 
None 


15. 14°13 #12 11°10 9 8 7 6 5 4 3 2 1 0 
1 0 14 74 74 4 4 0 0 4 0 4 4 O 14 0 


(PC) +1 — PC 
16 x (TREG1(4)) — count 
(ACC) right-shifted by count — ACC 


Affected by SXM. 


The accumulator is barrel-shifted right by 16 bits if bit 4 of TREG1 is aone. If bit 4of TREG1 
is a zero, the accumulator is unaffected. Zeroes are shifted in if SXM=0. Copies of 
ACC(31) are shifted in if SXM=1. The SATH instruction in conjunction with the SATL in- 
struction allows a 2-cycle 0- to 31-bit right shift. The carry bit is unaffected. 


1 
1 


SATH ; (SXM = 0) 


Before Instruction After Instruction 
ACC ACC 

Cc C 
TREG1 TREG1 

Example2 SATH- ; (SXM = 1) 

, Before Instruction After Instruction 
ACC ACC 

Cc Cc 
TREGI1 TREG1 


4-158 


Assembly Language Instructions 


SBB Subtract ACCB From Accumulator a 


Syntax 
Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


4-160 


OD TELE ELLOS OSLO LOS LLL OS VOLES IOLA IOP LALIT LS LES LDL OL ADIT LILI EEL LER ETL DETALLES IIOR 


[label] SBB 
None 
15 14 13 12 11109 8 7 6 § .4 3 2 1 9QO 
1.0 1 1 1 1 1 0 ¢) 0 0 1 1 (0) 0 0 
(PC)+1— PC 


(ACC) —(ACCB) + ACC 


The contents of the accumulator buffer (ACCB) are subtracted from the contents of the 
accumulator. The result is stored in the accumulator, and the accumulator buffer is not af- 
fected. The carry bit is reset to zero if the result of the subtraction generates a borrow. 


1 
1 


SBB 
Before Instruction After Instruction 


ACG 20000000h acc [1] 10000000h 
Cc Cc 
ACCB 70000000h ACCB 70000000h 


Assembly Language Instructions 


Syntax [label SBRK #k 
Operands Q<k<255 


Opcode | 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #90 


Short: | 0 1 1 1 1 1 0 6«~O 8-Bit Constant 


Execution (PC)+1— PC 
AR(ARP) — 8-bit positive constant — AR(ARP) 


Description The 8-bit immediate value is subtracted, right-justified, from the currently selected auxilia- 
ry register with the result replacing the auxiliary register contents. The subtraction takes 
place in the ARAU, with the immediate value treated as a 8-bit positive integer. 


Words 1 
Cycles 1 
Example SBRK #0FFh 


Before Instruction After Instruction 
ARP ARP 
AR7 AR7 OFFOth 


4-162 Assembly Language Instructions 


stl AAR LA es 


St LEER LEER EELE EES ES ELLE ALLL ELLE EIEN LEE ELELLE SDE ELLEN ELLER EES ENE SE REE AEE 


Example SETC. TC. 39C is bit ‘11 of STI 
Before Instruction After Instruction 


ST ST1 


4-164 Assembly Language Instructions 


SFLB  ShiftACCB and Accumulator Left 


Syntax [labell SFLB 
Operands None 
Opcode 


15 14 13 #12 ~#11:°10 9 8 7 6 5 4 3 2 +1 +O 
1 0 14 4 14 4 4 0 0 0 0 4 0 4 +4 =O 


Execution (PC)+1— PC 
0 — ACCB(0) 
(ACCB(30-0)) — ACCB(31-1) 
(ACCB(31)) —- ACC(0) 
(ACC(30-0)) + ACC(31-1) 
(ACC(31) — C 


Affects C. 
Not affected by SXM bit. 


Description The SFLB instruction shifts the concatenation of the accumulator (ACC) and accumulator 
buffer (ACCB) left by one bit position. The least significant bit of the accumulator buffer 
is filled with a zero, and the most significant bit of the accumulator buffer is shifted into the 
least significant bit of the accumulator. The most significant bit of the accumulator is shifted 
into the carry bit (C). The SFLB instruction is unaffected by SXM. 


Words 1 
Cycles | 
Example SFLB 
Before Instruction After Instruction 
ACC | 0B0001234h] ACC 60002469h 
Cc Cc 


ACCB 0B0001234h ACCB 60002468h 


4-166 | Assembly Language Instructions 


Syntax [labe] SFRB 
Operands None 


Opcode 


15 14 (13 12 11:10 9 8 7 6 5 4 3 2 i 0 
1 0 14 4 14 4 4 0 0 0.0 74 0 4 1 = 1 


Execution (PC)+1— PC 


If SXM=0: 
Then 0 — ACC(31) 
If SXM=1: 
Then (ACC(31)) -» ACC(31) 


(ACC(31—1)) —» ACC(30-0) 
(ACC(0)) —» ACCB (31) 
(ACCB(31—1)) —» ACCB(30-0) 
(ACCB(0)) > C 


Affects C. 
Affected by SXM. 


Description The SFRB instruction shifts the concatenation of the accumulator (ACC) and accumulator 
buffer (ACCB) right by one bit position. The LSB of the ACCB is shifted into the carry bit. 


If SXM=1, the instruction produces an arithmetic right shift. The sign bit (MSB) of the accu- 
mulator is unchanged and is also copied into bit 30. Bit 0 of the accumulator buffer is 
shifted into the carry bit (C). 


If SXM=0, the instruction produces a logic right shift. All of the accumulator and accumula- 
tor buffer bits are shifted right by one bit. The least significant bit of the accumulator buffer 
is shifted into the carry bit, and the most significant bit of the accumulator is filled with a 


zero. 
Words 1 
Cycles | 
Example1 SFRB~ ; (SXM = 0) 
Before Instruction After Instruction © 
ACG 080001235h acc. [0] _5800091Ah 
Cc C 
ACCB 0B0001234h ACCB 0D800091Ah 
Example2 SFRB- ; (SXM = 1) 
Before Instruction After Instruction 
ACC ___0B0001234h acc [o] 0D800091Ah 
Cc Cc 
ACCB 0B0001234h ACCB 05800091 Ah 


4-168 Assembly Language Instructions 


Example2 sMMR- *,#307h,AR6 ;(CBCR = 1Eh) 


Before Instruction After Instruction 

ARP aa ARP Cs Ld 

ARG AR6 
Data Memory Data Memory 

307h 307h 

CBCR CBCR 


4-170 Assembly Language Instructions 


SPH _ Store High P Register 


Syntax 
Operands 


Opcode 


Execution 
Description 


Words 
Cycles 
Example 1 


Example 2 


4-172 


5 


CORE A AAA A A LEE EEA IN AI NIA NLS A RELL ES IEE RELE SEE IER PLES SERIES EINER ELLE LEASES REALS EA NIE NSN IRA K ARIE SERRA ELIE CE ha aa ea i el AE te ln a tte: 


Direct: [label SPH dma 
Indirect: [label] SPH {ina} [,next ARP| 


0<dma< 127 
0 < next ARP < 7 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 +1 =#=20 
Direct: | 1 0 0 0 1 1 0 1 Data Memory Address 


0 | 
15 14 13 12 11 10 9 8 7 6 § 4 3 2 1 =O 
indirect: [4 0 0 0 4 1 0 1|1t]|  SeeSubsecion4i2 
(PC) +1 — PC 
(P register shifter output (31-16)) —> dma 
Affected by PM. 


The high-order bits of the P register, shifted as specified by the PM bits, are stored in data 
memory. Neither the P register nor the accumulator is affected by this instruction. High-or- 
der bits are sign-extended when the right-shift-by-6 mode is selected. Low-order bits are 
taken from the low P register when left shifts are selected. 


1 
1 
SPH DAT3 ; (DP = 4, PM = OQ). 
Before Instruction After Instruction 
P QFE079844h P OQFE0O79844h 
203h 4567h 203h OFEO7h 
SPH *,AR7T ; (PM = 2) 
: Before Instruction After Instruction 
ARP rT: ARP 
AR6 ARG 203h 
P OFE079844h P OFE079844h 
Data Memory Data Memory 
203h 4567h 203h ___0E079h 


Assembly Language Instructions 


Syntax Direct: _—‘[/abel] SPLK #/k,dma 
Indirect: [label] SPLK #Ik, {ina} [,next ARP 


Operands (Q<dma< 127 


0< next ARP <7 
lk: 16-bit constant 


Opcode 


15 14.113 12 11 °:10 9 8 


7 6 5 4 3 2 1 0 
1 O 0: | Data Memory Address 


1 OO 17 1 
Direct: 


15 14 (13 ~=12 ~— 11 S10 


Indirect: 


Execution (PC)+2 — PC 
Ik — dma 


1 
16-Bit Constant 


9 8 7 6 § 4 3 2 | 0 
ee See Subsection 4.1.2 
16-Bit Constant 


Description The SPLK instruction allows a full 16-bit pattern to be written into any memory location. 
The parallel logic unit (PLU) supports this bit manipulation independently of the ALU so 


that the ACC is unaffected. 
Words 2 
Cycles 2 
Example? sSPLK #7FFFh,DAT3 ; (DP = 6) 


Before Instruction 


Data Memory 
303h OFEO7h 


Example2 sSPLK #1111h,*+,AR4 
Before Instruction 


ARP 

AR4 
Data Memory 

300h 


4-174 


Data Memory 
303h 


ARP 
AR4 


Data Memory 
300h 


After Instruction 


7FFFh 


After Instruction 


301 


1111 


Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


Cycles 
Example 1 


Example 2 


4-176 


Direct: 
Indirect: 


[label SQRA dma 
[label] SQRA {ind} [,next ARP] 


O<dma< 127 
0 <s next ARP < 7 


16 14 13 12 11 1 


7 6 § 4 3 2 1 =O 


Direct: | 0 1 0 14 0 0 141 #Of 0] Data Memory Address 


15 14 #13 #12 #+(141 =«10 


Indirect: } O 1 #0 141 0 0 4 #0 


(PC) +1 — PC 

(ACC) + (shifted P register) —- ACC 
(dma) — TREGO 

(dma) x (dma) — P register 


Affects OV and C. 
Affected by PM and OVM. 


7 
1 


6 5 4 3 2 1 =O 
See Subsection 4.1.2 


The contents of the P register, shifted as defined by the PM status bits, are added to the 


accumulator. The addressed data memory val 
stored in the P register. 


1 
1 


ue is then loaded into TREGO, squared, and 


SQRA DAT30 ;(DP = 6, PM = 0). 
Before Instruction After Instruction 
Data Memory Data Memory 
31Eh 31Eh 
TREGO CH TREGO OFh 
P P 
ACC acc [0] 
Cc Cc 
SQRA *,AR4 ;(PM = 0). 
Before Instruction After Instruction 
ARP aaa! ARP ee ae 
AR3 AR3 
Data Memory Data Memory 
31Eh 31Eh 
TREGO CH TREGO OFh 
P p 
ACC acc. [0] 
Cc C 


Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 
Example 1 


4-178 


Direct: [labell SST #n, dma 
Indirect: [label] SST #n, {ind} [,next ARP 


0<dma< 127 
n=0,1 
0 < next ARP <7 


Store Status RegisterO SST#0 
15 14 13 12 11 #10 7 6 5 4 3 2 1 OQ 


9 8 
Direc: | 1 0 0 0 1 1 #1 #Of 0) Data Memory Address 


15 14 #13 #12 11:10 9 8 7 6 5 4 3 2 1 O 
Indirect!| 1 0 0 0 141 1 #14 #Of 1) See Subsection 4.1.2 


Store Status Register1 SST#1 
156 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =90 


Direct: | 1 0 0 0 141 1 #4 #=140' Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 i1 90 


Indrect:| 1 0 0 0 4 14 #1 =#<4/4 1) See Subsection 4.1.2 


(PC) +1 — PC 
(status register STn) > dma 


Status register STn is stored in data memory. In the direct addressing mode, status regis- 
ter STnis always stored in page 0, regardless of the value of the DP register. The proces- 
sor automatically forces the page to be 0, and the specified location within that page is 
defined in the instruction. Note that the DP register is not physically modified. This allows 
storage of the DP register in the data memory on interrupts, etc., in the direct addressing 
mode without having to change the DP. In the indirect addressing mode, the data memory 
address is obtained from the auxiliary register selected (see the LST instruction for more 
information). Inthe indirect addressing mode, any page in data memory may be accessed. 


Status registers STO and ST1 are defined in subsection 3.6.3, Status and Control Regis- 
ters. 


1 
1 


SST #0,DAT96 ; (DP = 6) . 
Before Instruction After Instruction 


STO OA408h STO OA408h 
Data Memory Data Memory 
60h 60h OA408h 


Assembly Language Instructions 


Syntax Direct: [label] SUB dmal{,shiftT] 
Indirect: [label SUB {ind} [,shift7 [,next ARP] 
Short Immediate: [/abel SUB #k 
Long Immediate: [/abel] SUB #/k[,shift2] 


Operands (Q<dmas<127 
O<shifti<16 (defaults to 0) 
0 < next ARP < 7 
0<k<255 
—32768 < Ik < 32767 
O<shift2<15 (defaults to 0) 


Opcode 
Subtract from accumulator with shift 
15 14 13 12 11109 8 7 6 5 4 3 2 14 9O 


Direct: [001 1] SHET? [0] Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
Indirect | 0 0 1. 1 SHFT T Pate,| See Subsection 4.1.2 


Subtract from accumulator with shift of 16 
15 14 13 12 11 10 9 8 6. -5 4. 3 2 2 Tt. 20 


7 
Direct: | 0 1 1 0 0 1 0 411]0] Data Memory Address 


15 14 13 12 11109 8 7 6 5 4 3 2 1 Q 
Indirect:} O 1 1 0 0 17 OO 1 1 See Subsection 4.1.2 


Subtract from ACC short immediate 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 #4 0 
Shor:| 1 0 1 #4 14 0 4 #@Q. 8-Bit Constant 
Subtract from ACC long immediate with shift | 
15 14 13 12 11:°#10 «9 8 7 6 5 4 3 2 1 0 


1 0 14 74 4 #74 4 4 4 O 14 0 SHFT Tt 
Long: 
i 16-Bit Constant 


Tt See Section 4.5. 
Execution Direct or Indirect Addressing: 


(PC)+1— PC 

(ACC) —[(dma) x 2shiftt] —, ACC 
Affects C and OV. 

Affected by SXM and OVM. 


_ Short Immediate Addressing: 


(PC) +1 — PC 
(ACC) -k > ACC 


4-180 . Assembly Language Instructions 


SUB Subtract From Accumulator 


Example 4 suB #OFFFh,4 ;(SXM = 0) 
Before Instruction After Instruction 


ACC | OFFFFh Acc [1] OFh 
Cc , 


4-182 . Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


4-184 


Direct: [labe] SUBC dma 
Indirect: [label] SUBC {ind} [,next ARP| 


0<dma< 127 
0 < next ARP < 7 


15 14 13 12 11:10 9 8 7 6 5 4 3 2 1 0 
0 


Direct: | 0 0 0 0 1 0 1 | 0 Data Memory Address 


15.14 13 #12 #11 :°10 9 8 7 6 5 4 
0 O 0 1 0 


3 2 1 6) 
0 0 1 See Subsection 4.1.2 


Indirect: 


(PC)+1— PC 
(ACC) —[(dma) x 215] — ALU output 


lf ALU output = 0: 
Then (ALU output) x2 +1 — ACC; 
Else (ACC) x2 — ACC. 


Affects OV and C. 
Affected by SXM. 
Not affected by OVM (no saturation) . 


The SUBC instruction performs conditional subtraction, which may be used for division. 
The 16-bit dividend is placed in the low accumulator, and the high accumulator is zeroed. 
The divisor is in data memory. SUBC is executed 16 times for 16-bit division. After comple- 
tion of the last SUBC, the quotient of the division is in the lower-order 16-bit field of the 
accumulator, and the remainder is in the higher-order 16-bits of the accumulator. SUBC 
assumes that the divisor and the dividend are both positive. The SXM bit will affect this 
operation. If SXM=1, then the divisor must have a0 value in the MSB. If SXM=0, then any 
16-bit divisor value will produce the expected results. The dividend, which is in the accu- 
mulator, must initially be positive (i.e., bit31 must be 0) and must remain positive following 
the accumulator shift, which occurs in the first portion of the SUBC execution. 


If the 16-bit dividend contains fewer than 16 significant bits, the dividend may be placed 
in the accumulator and left-shifted by the number of leading nonsignificant zeroes. The 
number of executions of SUBC is reduced from 16 by that number. One leading zero is 
always significant. 


Note that SUBC affects OV but is not affected by OVM, and therefore the accumulator 
does not saturate upon positive or negative overflows when executing this instruction. The 
carry bit is affected in the normal manner during this instruction. 


1 
1 


Assembly Language Instructions 


SUBS Subtract From or With Sign-Extension Suppressed 


Syntax Direct: [label SUBS dma 
Indirect: [labe] SUBS {ind} [,next ARP| 


Operands Q<dmas< 127 
0 <next ARP <7 


Opcode 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
0 


Direct} 0 1 14 #0 0 1 1 | 0 | Data Memory Address 


15 14 13 12 11:10 9 8 7 6 5 
1 1 


4 3 2 1 0 
Indirect: | O 1 1 0 6#«~O 0 See Subsection 4.1.2 


Execution (PC)+1— PC 
(ACC) — (dma) — ACC 


Affects OV and C; affected by OVM. 
Not affected by SXM. 


Description The contents of the specified data memory location are subtracted from the accumulator 
with sign extension suppressed. The data is treated as a 16-bit unsigned number, regard- 
less of SXM. The accumulator behaves as a signed number. SUBS produces the same 
results as a SUB instruction with SXM = 0 and a shift count of 0. 


Words 1 
Cycles 1 
Example? SUBS DAT2 ;(DP = 16, SXM= 1). 
Before Instruction After Instruction 
Data Memory Data Memory 
802h OQFO03h 802h OFO03h 


ACC OF105h ACC 
C Cc 


Example2 suBS * ;(SXM = 1) 


Before Instruction After Instruction 
ARP a ARP 
ARO ARO 
Data Memory Data Memory 
310h 310h 


ACC OFFFF105h ACC OFFFO102h 
Cc Cc 


4-186 Assembly Language Instructions 


Example 2  suBT 


* 


Before Instruction 


ARP re | 
ARi 800h 
Data Memory 


4-188 


a0ch 


TREG! 


ACC 
C 


: After Instruction 
ARP 


ARi 800h 
Data Memory 

800h 

TREG! (08h 


acc [o] OFFFFFFOOh 
Cc 


Assembly Language Instructions 


Example2 TBLR- *,AR7 


ARP 
ARO 
ACC 


Program Memory 
24h 


Data Memory 
300h 


4-190 


Before Instruction 


ARP 
ARO 


24h ACC | 
Program Memory 
307h 24h 
. Data Memory 
300h 


After Instruction 


300 


Ly) 
ms 
SIS TN 


307 


il 


307 


Assembly Language Instructions 


TB LW Table Write 


Example2 TBLW * 


ARP 
AR6 
ACC 


Data Memory 
1006h 


Program Memory 
258h 


4-192 


Before Instruction 


| 


ARP 
ARG 
ACC 

Data Memory 
1006h 


Program Memory 


258h 


After Instruction 


_1006h 
258 


4340 


4340 


Assembly Language Instructions 


XC Execute 


Syntax 
Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Example 


4-194 


OPO AAR 


Conditional) an 


[label] XC k [,cond71] [,condZ [.... 


k=1or2 

Conditions: ACC=0 EQ 
ACCz0 NEQ 
ACC<0 LT 
ACCs0 LEQ 
ACC>0 GT 
ACC20 GEQ 
C=0 NC 
C=1 C 
Ov=0 NOV 
OV=1 OV 
BIO low BIO 
TC=0 NTC 
TC=1 TC 
Unconditional UNC 

12 11 #10 #9 6 5 4 3 2 1 0 


cit NtLo 1] we? | aver | awet 


Tt See Section 4.5. 


If (condition(s)) 
Then next k instructions executed 
Else execute NOP’s for next k instructions 


If k= 2 andconditions are met, the one two-word instruction or two one-word instructions 
following the XC instruction execute. If k = 1 and conditions are met, the one-word instruc- 
tion following the XC instruction executes. If the conditions are not met, one or two NOPs 
are executed. Note that not all combinations of conditions are meaningful. The XC instruc- 
tion and two-instruction words following the XC are uninterruptible. 


1 
1 


XC 1,LEQ,C 
MAR *+ 
ADD DAT100 


If the accumulator contents are less than or equal to zero and the carry bit is set, the ARP 
is modified prior to the execution of the ADD instruction. 
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Example2 XOR 


Example 3  XOR 


DAT127 ; (DP = 511) 
Before Instruction 


Before Instruction 


11111010h 


After Instruction 


Data Memory 
OFFFFh OFOFOh 


ACC 1234A688h 
Cc 


After Instruction 


ARP ae) 


AR7 301h 
Data Memory 
300h OFFFFh 


ACC 12340F0Fh 
Cc 


After Instruction 


ACC 111E1F10h 
Cc 
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Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 
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Direct: [label XPL [#/k,] dma 
Indirect: [label XPL [#/k,] {ina} [,next ARP 


0 <dma < 127 
Ik: 16-bit constant 
0 < next ARP < 7 


XOR DBMR with data value | 
15 14 13 12 11 10:9 8 6 5 4 3 2 1 0O 


7 
Direc: | 0 1 0 1 #1 #0 0 0]f0| Data Memory Address 


15 14 #13 ~#12 ~«11:°:10 9 8 7 6 5 4 3 2 1 0 
Indirect: | 0 1 0 1 1 0 oO O 1 See Subsection 4.1.2 


XOR long immediate with data value 


15 14 13 #12 #11 :10 9 8 7 6 5 4 3 2 1 0 


Direct 0 10 14 14 #474 0 0 a Data Memory Address | 
irect: 
16-Bit Constant 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 OQ 
0 1 0 1 1 1 0 0 ti | — See Subsection 4.1.2 


16-Bit Constant 


Ik unspecified: 


Indirect: 


(PC) +1 — PC 
(dma) XOR (DBMR) — dma 


Ik specified: 


(PC) +2 — PC 
(dma) XOR Ik > dma 
Affects TC. 


If a long immediate constant is specified, it is XORed with the addressed data memory 
value. If it is not specified, the addressed data memory value is XORed with the contents 
of the dynamic bit manipulation register (DBMR). In either case, the result is written back 
into the specified data memory location, and the accumulator contents are not disturbed. 
If the result of the XOR operation is 0, then the TC bit is set to 1. Otherwise, the TC bit 
is set to 0. 


(Long immediate value not specified) 
(Long immediate value specified) 


(Long immediate value not specified) 
(Long immediate value specified) 


NMN— WM — 
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ZALR Zero Low Accumulator Load High Accumulator with Rounding 


WSS EOSESEE RIL IIE GIOIA EE RRR sD Keb 


Se SEEN REV LED TRESS IIIT IEEE LEI IIE ERED EERIE ITD SELINA IO AIT ER DTI A RII AOE ASIII TANI RIA IERIE IRATE IIIT LOA RN IIT 


Syntax Direct: [label] ZALR dma 
Indirect: [label] ZALR {ind} [,next ARP] 


Operands (Q<dma<127 
0 s next ARP <7 


Opcode 


15 14 13 12 11 #10 9 
Direct: } 0 1 #1 #0 #4 OO O 


7 6 5 4 3 2 1 0 


| 0 | Data Memory Address 


15 14 13 12 11:10 9 8 7 6 5 4 3 2 ji oO 


indirect: [0.1 1.0 1.0 0 0 Ea See Subsection 4.1.2 


Execution (PC)+1— PC 
8000h — ACC(15-0) 
(dma) — ACC(31-16) 


Description \norder to load adata memory value into the high-order half of the accumulator, the ZALR 
instruction rounds the value by adding 1/2 LSB; that is, the 15 low bits (bits 0-14) of the 
accumulator are set to zero, and bit 15 of the accumulator is set to one. 


Words 1 


Cycles 1 
Example1 ZALR DAT3~ ;(DP = 32) 
Before Instruction After Instruction 
Data Memory Data Memory 
1003h 41003h 
ACC ACC 
Cc Cc 
Example2 ZALR  *-,AR4 
Before Instruction After Instruction 
ARP ARP 
AR7 AR7 
Data Memory Data Memory 
OFFOOh OFFOOh 
ACC ACG 
Cc Cc 


4-200 | Assembly Language Instructions 


Syntax _ [label] _ZPR 
Operands None 


Opcode 
15 14 13 12 #11 +#10 
1 O 1 1 1 1 
Execution (PC)+1— PC 
0 — PREG 
Description The product register is set to zero. 
Words 
Cycles 1 


Example ZPR 
Before Instruction 


PREG 3F0111141h 
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PREG 


After Instruction 


00000000h 
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Table 4-5. 


4-204 


Mapping Summary (Continued) | 


Accumulator Memory Reference Instructions 
(Concluded) 


TMS320C2x Mnemonic TMS320C5x Mnemonic 
SUBC SUBC 


SUBH_ 


SUB 
8] 
oR 
: 
ch 
rR 
S 


ZAL 


SUBS 
SUBT 


S 


i 


Auxiliary Registers and Data Page Pointer Instructions 
TMS320C2x Mnemonic TMS320C5x Mnemonic 
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Table 4-5. 
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Mapping Summary (Continued) 


Branch/Call Instructions (Concluded) 


TMS320C5x Mnemonic 
Son 
sch 

BNV BCND 

BNZ BCND 


w 
< 


BCND 
BCND 
CALA 
CALL 
E RET 
TRAP | TRAP. 

1/0 and Data Memory Operations 


TMS320C2x Mnemonic 
BLEP 
APL 


| CALA 
CALL 


23 
- 


Cc 


T 


a 


RTXMt 


x< 


F CLRC 


OUT 
APL 
APL 
i 
sro 


” 
x 


TBLW TBLW 
t The suggested mapping requires that the data page pointer 
be set to 0. 
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Instruction Set Opcode Table 


This section contains a table that summarizes the opcodes of the instruction 
set forthe TMS320C5x digital signal processors. This instruction set is a super- 
set of the TMS320C1x and TMS320C2x instruction sets. The instructions are 
arranged according to function and are alphabetized within each category. 


The following symbols are used in the opcode table: 


SS 


Addressing mode bit. O= direct addressing mode 
1 = indirect addressing mode 


Short Immediate value 


INTR# Interrupt vector number 


Constant copied into PM bits in status register ST1. See SPM instruction 


ee Three-bit shift value 
SHFT Four-bit shift value. 


Field for the XC instruction indicating the number of instructions (one or two) to con- 
ditionally execute. 


Lo ‘Two bits used by the conditional execution instructions to represent the conditions 


TC, NTC, and BIO. 


Four-bit field representing the following conditions: 


S 
° 
. 
. 
. 
. 
. 


ZLVC 


Z: ACC=0 
L: ACC <0 
V: Overflow 


C: Carry 


A conditional instruction contains two of these four-bit fields. The four-LSB field of 
the instruction is a four-bit mask field. A one in the corresponding mask bit indicates 
that condition is being tested. The second four-bit field (bits 4 — 7) indicates the state 
of the conditions designated by the mask bits as being tested. For example, to test 
for ACC 20, the Z and L fields will be set, while the V and C fields are not set. The 
next four-bit field contains the state of the conditions to test. The Z field willbe appro- 
priately set to indicate to test the condition ACC = 0, and the L field will be reset to 
indicate to test the condition ACC greater than 0. The conditions that can be formed 
from these 8 bits are shown in the BCND, CC, and XC instruction set pages. In order 
to determine ifthe conditions are met, the four LSB bit maskis ANDed with the condi- 
tions. If any bits are set, the conditions are met. 


Indicates the instruction is a two-word instruction. The second word is a 16-bit long 
immediate value or a 16-bit program memory address for immediate addressing. 
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Instruction Set Opcode Table 


Table 4-6. 
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Opcode Summary (Continued) 


Accumulator Memory Reference Instructions (Concluded) 
[Instruction Mnemonio [Opodo 


Subtract from accumulator with shift SHFT IAAA AAAA 
Subtract from accumulator with shift of 16 0110 0101 IAAA AARA 

Subtract from ACC short immediate 1011. 1010 ITEI JIN 

Subtract from ACC long immediate with shift 1011 1111 1010 SHFT +1 word 
Subtract from accumulator with borrow 0110 0100 IAAA AAAA 
Conditional subtract 0000 1010 [AAA AAAA 

Subtract from ACC with sign suppressed 0110 0110 IAAA AAAA 

Subtract from ACC, shift specified by TREG1 0110 0111 IAAA AAAA 

XOR accumulator with data value 0110 1100 IAAA AAAA 

XOR with ACC long immediate with shift 1011 1111 1101 SHFT +1 word 
XOR with ACC long immediate with shift of 16 1011. 1110 1000 0011 +1 word 
XOR ACCB with accumulator 1011. 1110 0001 1010 

Zero ACC, load high ACC with rounding 0110 1000 IAAA AAAA 

Zero accumulator and product register 1011 1110 0101 1001 


Auxiliary Registers and Data Page Pointer Instructions 
[struction Mnemmonie [Oped 


Add to AR short immediate 0111 1000 IIII Jt 
Compare AR with CMPR 1011 1111 0100 01CM 

Load AR from addressed data 0000 OARX IAAA AAAA 

Load AR short immediate 1011 OARX IIIT TI 

Load AR long immediate 1011 1111 0000 1ARX +1 word 
Load data page pointer with addressed data 0000 1101 IAAA AAAA 

Load data page immediate 1014 1101 TIT Till 

Modify auxiliary register 1000 1011 IAAA AAAA 

Store AR to addressed data 1000 OARX IAAA AAAA 

Subtract from AR short immediate 0111 1100 HITT I) 


AND DBMR with data value 
AND long immediate with data value 
Compare DBMR to data value 
Compare data with long immediate 
OR DBMR to data value 

OR long immediate with data value 
Store long immediate to data 

XOR DBMR to data value 

XOR long immediate with data value 


+ 1 word 


+ 1 word 


0101 1101 IAAA AAAA +1 word 
+ 1 word 


+ 1 word 
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Instruction Set Opcode Table 


Table 4-6. 
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Opcode Summary (Continued) 


Branch Instructions 


[_siveson [aes [ope 


Branch unconditional with AR update 
Branch unconditional with AR update delayed 
Branch addressed by ACC 

Branch addressed by ACC delayed 

Branch AR = 0 with AR update 

Branch AR = 0 with AR update delayed 
Branch conditional 

Branch conditional delayed 

Call subroutine addressed by ACC 

Call subroutine addressed by ACC delayed 
Call unconditional with AR update 

Call unconditional with AR update delayed 
Call conditional 

Call conditional delayed 

Software interrupt 

Nonmaskable interrupt 

Return 

Return conditional 

Return conditionally, delayed 


Return, delayed 

Return from interrupt with enable 

Return from interrupt 

Trap 

Execute next one or two INST on condition 


Block move from data to data memory 

Block move data to data DEST long immediate 
Block move data to data with source in BMAR 
Block move data to data with DEST in BMAR 
Block move data to PROG with DEST in BMAR 
Block move from program to data memory 
Block move Prog to data with source in BMAR 
Data move in data memory 

Input external access 

Load memory mapped register 

Out external access 

Store memory mapped register 

Table read 

Table write 


0111 
1111 
1001 
1100 
1001 
0110 
0111 


1AAA AAAA 
1AAA AAAA 
0010 0000 
0010 0001 
1AAA AAAA 
1AAA AAAA 
ZLVC ZLVC 
ZLVC ZLVC 
0011 0000 
0011 1101 
1AAA AAAA 
1AAA AAAA 
ZLVC ZLVC 
ZLVC ZLVC 
0111 NTR# 
0101 0010 
0000 0000 
ZLVC ZLVC 
ZLVC ZLVC 
0000 0000 
0011 1010 
0011 1000 
0101 0001 
ZLVC ZLVC 


[AAA 
IAAA 
IAAA 
IAAA 
IAAA 
IAAA 
IAAA 


+ 1 word 
+ 1 word 


+ 1 word 
+ 1 word 
+ 1 word 
+ 1 word 


+ 1 word 
+ 1 word 
+ 1 word 
+ 1 word 


+ 1 word 
+ 1 word 


+ 1 word 


+ 1 word 
+ 1 word 
+1 word 
+ 1 word 
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5 | Peripherals 


5.1 


5.1.1 


Peripheral Control 


Peripheral Control 


Peripheral circuits are operated and controlled through access of memory- 
mapped control and data registers. The operation of the serial ports and timer 
is synchronized to the processor via interrupts or through interrupt polling. Set- 
ting and clearing bits can enable, disable, initialize, and dynamically reconfi- 
gure the peripherals. Data is transferred to and from the peripherals through 
memory-mapped data registers. When a peripheral is not in use, the internal 
clocks are shut off from that peripheral, allowing for lower power consumption 
when the device is in normal run mode or idle mode. 


Memory-Mapped Registers and I/O Ports 


Twenty-eight core processor registers are mapped into the data memory 
space. These are listed in subsection 3.4.1 of this user’s guide. In addition to 
these core registers, 15 peripheral registers and 16 I/O ports are mapped into 
the data memory space. Table 5~1 lists the memory-mapped registers and I/O 
ports of the TMS320C5x. Note that all writes to memory-mapped peripheral 
registers require one additional machine cycle. 


Table 5-1. Memory-Mapped Registers and I/O Ports 


5-2 


Memory-Mapped Core Processor Registers 


aC: et eA ST 
[5 [Global Memon Aoeaion Regier 
[8 erpt ag Regier 
co 
=e 


Repeat Counter Register 


0-3 
a 
fenecas 
Eee 
awe 


12 Cc 
Temporary Register Used for Dynamic Shift 
Temporary Register Used as Bit Pointer in Dy- 
TREG2 pom fe | namic Bit Test (4 bits only) 


[pana [75 [Ff [Branioanipaton Regier 
Paro [8 [10 [Aan Regiterzaro 
[ane [te [ 2 [Aan Register 
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Table 5-1. 
5.1.2 Interrupts 
5-4 


Memory-Mapped Registers and I/O Ports (Concluded) 


Description 


ik" 
[Bee [Fox 


Memory-Mapped I/O Ports 


ae Resend 
a 
A 
OO 

Oe 
a 
OS 
a 
a A 
as 
a 
a 
ee 
sf = 
| =| 
a ae 


The TMS320C5x devices have four external, maskable user interrupts 
(INT4—INT1) that external devices can use to interrupt the processor; there is 
one nonmaskable interrupt (NMI). Internal interrupts are generated by the seri- 
al port (RINT and XINT), by the timer (TINT), by the TDM port (TRNT and 
TXNT), and by the software interrupt instructions (TRAP, NMI, and INTR). In- 
terrupt priorities are set so that reset (RS) has the highest priority and the TDM 
port transmit interrupt (TXNT) has the lowest priority. The NMI effectively has 
the same priority as RS. 


This subsection explains interrupt organization and management. Vector rela- 
tive locations and priorities for all internal and external interrupts are shown in 
Table 5-2. No priority is set for the TRAP instruction (used for software inter- 
rupts), but itis included here because it has its own vector location. Each inter- 
rupt address has been spaced apart by two locations so that branch instruc- 
tions can be accommodated in those locations. 


The interrupt vectors reside at locations determined by the five-bit IPTR field 
of the PMST and the address values shown in Table 5-2. The IPTR field is set 
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15-9 8 7 6 5 4 3 2 1 0 
RESERVED INT4 TXNT TRNT XINT RINT TINT INT3 INT2~ INT4 


Note that the TMS320C50 and TMS320C51 make use of only ten of the sixteen 
generic interrupt lines to the core CPU shown in Section 3.8. 


A one in a specific bit, when read, indicates an active interrupt. For example, 
if the IFR is read to be 0005h, then INT3 and INT1 are active. A one can be 
written to a specific bit to clear the corresponding interrupt. In the example, if 
a one is written to bit zero (0001h to IFR), then the INT1 interrupt would be 
cleared. In the above example, the value 0005h could be written back into the 
IFR to clear both pending interrupts. 


A corresponding interrupt flag is automatically cleared when the interrupt trap 
is taken. When the CPU accepts the interrupt and fetches the instruction at the 
interrupt vector location, it generates an interrupt acknowledge (IACK) signal 
that clears the appropriate interrupt flag bit. A hardware reset (RS active low) 
clears all pending interrupt flags. 


The TMS320C5x devices have a memory-mapped interrupt mask register 
(IMR) for masking external and internal interrupts. The layout of the register 
is as follows: 


15-9 8 7 6 5 4 3 2 1 0 


A 1 in bit positions 8 through 0 of the IMR enables the corresponding interrupt, 
provided that INTM = 0. The IMR is accessible with both read and write opera- 
tions. Note that RS and NMI are not included in the IMR; the IMR has no effect 
on reset or a nonmaskable interrupt. 


Interrupts may be asynchronously triggered. In the functional logic organiza- 
tion for INT4—-INT1, shown in Figure 5—1, the external interrupt INTn is syn- 
chronized to the core via a five flip-flop synchronizer. The actual implementa- 
tion of the interrupt circuits is similar to this logic implementation. A one is 
loaded into the IFR if a 1-1-0-0-0 sequence on five consecutive CLKOUT1 
cycles is detected. 


The TMS320C5x devices sample the external interrupt pins multiple times to 
avoid noise-generated interrupts. To detect an active interrupt, these devices 
must sample the signal low on atleast three consecutive machine cycles. Once 
an interrupt is detected, the devices must sample the signal high on at least 
two consecutive machine cycles to be able to detect another interrupt. The ex- 
ternal interrupt pins are sampled on the rising edge of CLKOUT1. If the external 
interrupts are running asynchronously, the pulses should be stretched to guar- 
antee three consecutive low samples. 
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6) 


The two software wait-state registers are set to OFFFFh, causing all exter- 
nal accesses to occur with 7 wait states. The CWSR is loaded with OFh. 


The FO bits of the SPC and TSPC registers are set to zero, selecting a 
word length of 16 bits for each serial port. 


The FSM bits of the SPC and TSPC registers are set to zero. FSM must 
be set to one for operation with frame sync pulses. 


The TXM bits of the SPC and TSPC are set to zero, configuring the FSX 
and TFSX pins as inputs. 


The SPC and TSPC registers are loaded with OyOOh, where the 2 MSBs 
of y are 10 (binary) and the 2 LSBs of y reflect the current levels on the 
transmit and receive clock pins of the respective port. 


The TIM and PRD registers are loaded with OFFFFh. The TDDR field of 
the TCR is set to zero. The timer is started. 
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5.3 Software-Programmable Wait-State Generators 


5-10 


Software-programmable wait-state generators can be used to extend external 
bus cycles by up to 7 machine cycles. This provides a convenient means for 
interfacing external devices that do not satisfy the full-speed access-time re- 
quirements of the TMS320C5x. Devices requiring more than 7 wait states can 
be interfaced with the hardware READY line. When all external accesses are 
configured for zero wait states, the internal clocks to the wait-state generator 
are shut off, allowing the device to run in a lower power mode of operation. 


The software-programmable wait-state generators are controlled by two 16-bit 
wait-state registers (PDWSR and IOWSR) and a 5-bit control register 
(CWSR). Each of the three external spaces (program, data, and I/O spaces) 
has an assigned field in a software wait-state register. Wait states for the pro- 
gram and data spaces are specified in the lower and upper halves of PDWSR, 
respectively. Wait states for I/O space are specified in IOWSR. The bits of 
CWSR control the mapping between wait-state register contents and the num- 
ber of wait states. 


The program and data spaces each consist of 64K addresses. Each 64K space 
can be viewed as being composed of four 16K-word blocks. Each 16K address 
segment in program and data space is associated with 2 bits in PDWSR, as 
shown in Table 5—3. The value of a 2-bit field in PDWSR specifies the number 
of wait states to be inserted for each access in the given space and address 
range. 
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Table 5-4. Table 5-5 shows the layout of the CWSR register in PDWSR and 
lIOWSR registers. You should always program the CWSR register prior to con- 
figuring the PDWSR and IOWSR registers to avoid configuring memory with 
too few wait states during the set-up of wait-state registers. 


Table 5—4. Mapping Between Wait-State Field Values and # of Wait States as a Function of CWSR Bit n 


Wait-State Field No. of Wait States No. of Wait States 
of PDWSR or l|OWSR (CWSR Bit n = 0) (CWSR Bit n = 1) 
(Binary Value) 


Table 5-5. Space Controlled by CSWR Bit n 
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n (Bit Position in 
CWSR) 


Program 


| === 20 (lower-half: PORTO~PORT7 if BIG=0, 0000h-7FFFh if BIG=1) 
| == 3 ~~ | _—sVO (upper-half: PORT8-PORTF if BIG=0, 8000h-OFFFFh if BIG=1) 
BIG mode bit 


Figure 5-3 shows a block diagram of the wait-state generator logic for external 
program space. When an external program access is decoded, the appropriate 
field of the PDWSR wait-state register is loaded into the counter. If the field is 
not 000, anot-ready signal is sent to the CPU. The not-ready condition is main- 
tained until the counter decrements to zero and the external READY line is 
high. The external READY and the wait state register READY are OR’d togeth- 
er to generate the CPU WAIT signal. 


Upon reset, all the software wait-state control register. fields are setto 7. CWSR 
is set to OFh. Device reset also sets the BIG bit of the CWSR register to zero. 
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5.4 General-Purpose I/O Pins 


The TMS320C5x devices have two general-purpose pins that are software 
controlled. The BIO pin is a branch control input pin, and the XF pinis an exter- 
nal flag output pin. 


The BIO pin is useful for monitoring peripheral device status—especially as an 
alternative to an interrupt when time-critical loops must not be disturbed. A 
branch can be conditionally executed when the BIO input is active (low). The 
timing diagram, shown in Figure 5—4, is an example of the BIO operation. This 
timing diagram is for a sequence of single-cycle, signal-word instructions lo- 
cated in external memory. The BIO condition is sampled during the decode 
phase of the pipeline for the XC instruction. All other instructions sample the 
BIO pin during the execute phase of the pipeline. 


Figure 5-4. BIO Timing Diagram 


CLKOUT‘1 


® 
O 


Figure 5-5. 


CLKOUT1 


FETCH 


XF 
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XF (external flag) is useful for signalling to external devices via software. The 
XF output pin is set to a high level by the SETC XF (set external flag) instruction 
and reset to a low level by the CLRC XF (reset external flag) instruction. XF 
is set high upon device reset. The relationship between the time SETC/CLRC 
instruction is fetched and the time the XF pin is set or reset is shown in 
Figure 5-5. As with BIO, the timing shown for XF is for a sequence of single- 
cycle, single-word instructions located in external memory. Actual timing may 
vary with different instruction sequences. 


External Flag Timing Diagram 


Net NG Ne Ne 


: (SETC XF)/(CLRC XF) . 
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Table 5-6. _—_ Serial Port Bits, Pins, and Registers (Continued) 


Description 


Registers 


Data Transmit Register 


[BRR [ata Reset Register SSC—*S 
[RSs Sit Regier id 


The serial port uses two memory-mapped registers: the data transmit register 
(DXR) that holds the data to be transmitted by the serial port, and the data re- 
ceive register (DRR) that holds the received data. Both registers operate in ei- 
ther the 8-bit byte mode or the 16-bit word mode and may be accessed in the 
same manner as any other memory-mapped data memory location. Each reg- 
ister has an external clock, a framing synchronization signal, and an asso- 
ciated shift register. Any instruction accessing data memory or memory- 
mapped registers can be used to read from or write to the DXR and DRR. The 
DXR and DRR registers are mapped into data address space. The XSR and 
RSR registers are not directly accessible through software. 


If the serial port is not being used, the DXR and DRR registers can be used 
as general-purpose registers. In this case, FSR should be connected to a logic 
low to prevent a possible receive operation from being initiated. 


The control bits (DLB, FO, TXM, FSM, MCM, XRST, RRST) for the serial port 
reside in the serial control register (SPC). Figure 5-6 shows the serial control 
register bit positions. These bits can be set, cleared, toggled, or loaded via the 
PLU instructions. 


Figure 5-6. Serial Port Control Register (SPC) 

15-1413 12 11088 

IRES| RSRFULL | XSREMPTY | XRDY | RRDY | INt |INo| 

R R R R RR RR 
Note: R= Read 
W = Write 


Table 5-7. Serial Port Control Register Bits Summary 


| O| Reserved Always read as zero. 


The Digital Loopback Mode Bit can be used to put the serial portin digital loopback mode. When DLB=1, 
DLB 


DR and FSR are connected to DX and FSX, respectively, through multiplexers, as shown in 
5-16 Peripherals 


7 6 5 4 3 2 1 0 | 
Teast [east [oo [cu] row [ro [ous] Res 


RW R/W RW RW RW RWW RW OR 


Figure 5—7(a) and Figure 5-7(b). Additionally, CLKR is driven by CLKX if MCM=1. If DLB=1 and 
MCM=0, CLKR is taken from the CLKR pin of the device. This configuration allows CLKX and CLKR 
to be tied together externally and supplied by a common external clock source. The logic diagram for 
CLKR is shown in Figure 5—7(c). If DLB=0, DR, FSR, and CLKR are taken from the respective device 
pins. Note that TXM must be set to one for proper operation in DLB mode. Note also that the FSX and 
DX signals appear on the device pins when DLB=1, but FSR and DR do not. 


Serial . Port 


Figure 5-7. Receiver Signal MUXes 


(a) (b) 


DR FSR 
DR (internal) FSR (internal) 
DX FSX 
DLB (c) DLB 
CLKR 
CLKR (internal) 
CLKX 


DLB 
MCM 


The value of the SPC, upon device reset, is OyOOh where the 2 MSBs of y are 
10 (binary) and the two LSBs of y reflect the current levels on the CLKX and 
CLKR pins. 


5.5.1 Transmit and Receive Operations 


The transmit and receive sections of the serial port are implemented separate- 
ly to allow independent transmit and receive operations. Externally, the serial 
port interface is implemented via the six serial port pins. Figure 5-8 shows the 
registers and pins used in transmit and receive operations. | 
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Figure 5-9. Serial Port Transmit Timing Diagram (FSM=1, first byte = 62h) 
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Figure 5-10. Serial Port Receive Timing Diagram (FSM=1, first byte=62h) 
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lf DXR is reloaded before the old DXR contents have been transferred to XSR, 
the old DXR contents will be overwritten. The DXR is copied to the XSR only 
if the XSR is empty and the DXR has been loaded since the last DXR-to-XSR 
transfer. The DXR should be written only when XRDY=1. This condition is 
guaranteed if the DXR write is made in response to a transmit interrupt. 


lf TXM=1 and FSM=1, FSX pulses are generated internally and the FSX pin 
is configured as an output. To sustain a continuous bit stream on the DX trans- 
mitter output, DXR must be loaded every 8 or 16 bits, depending on the value 
of FO. Furthermore, the next word to be transmitted must be loaded in DXR 
at least 2 CLKX cycles prior to completion of transmission of the current word. 
If this condition is not satisfied, the transmitter will send the previous data from 
the register. 


lf TXM=0, the FSX pin is configured as an input. The transmitter behaves in 
the same way as when TXM=1, except that FSX pulses are supplied externally. 
Aconsequence of this is that the timing requirement on loading DXR for contin- 
uous-mode transmission is relaxed, because the processor does not impose 
a latency between DXR write and FSX active in this case. 


The transmitter’s operation with frame synchronization pulses has been de- 
scribed above. Both continuous operation and burst-mode operation (opera- 
tion with periods of transmitter inactivity) are possible when FSM=1. When 
FSM=0, only continuous-mode transmission is possible. Timing diagrams for 
transmit and receive operations in this mode are shown in Figure 5—11 and 
Figure 5-12. 
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contents of DXR when FSX goes high. If TXM=1 and DXR are written more 
than once during transmission of a given word, only the last word written to 
DXR will be transmitted; any previous values will be overwritten. Therefore, too 
many writes to the DXR during a given interval will not disturb the XSR con- 
tents, but an external FSX pulse will. 


The receive operation is similar to the transmit operation. The receive timing 
diagram with FSM=1 is shown in Figure 5-10. Reception is initiated by a frame 
synchronization pulse on the FSR pin. After FSR goes low, data on the DR pin 
is clocked into the RSR register on every negative-going edge of CLKR. The 
first data bit is considered the MSB, and RSR is filled accordingly. After all the 
bits have been received (as specified by FO), an internal receive interrupt 
(RINT) is generated on the falling edge of CLKR, while the contents of RSR 
are transferred to DRR. If, during a receive operation, anew FSR pulse comes 
in, the bit counter is reset and the RSR starts over. The bits already received 
are lost. 
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5.6.2 TDM Port Operation 


Figure 5~14(a) shows the TMS320C5x TDM port architecture. Up to eight de- 
vices can be placed on the four-wire serial bus. The four-wire bus consists of 
a conventional three-wire bus (TDAT, TFRM, and TCLk) and an additional line 
(TADD) to carry device-addressing information. Data is transmitted and re- 
ceived on the bidirectional TDAT line. Note that the device TDX and TDR pins 
are tied together externally to form the TDAT line. A framing pulse is supplied 
by one of the devices on the bus on the TFRM line. 


Figure 5-14. TDM Four-Wire Bus 
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This device is identified by setting the TXM bit of its TSPC register to one. Only 
one device should have TXM=1 at any given time. Typically, this processor is 
the same one that supplies the TDM port clock signal on TCLK. The TCLKX 
and TCLKR pins are tied together externally to form the TCLK line. TCLKR is 
always an input. TCLKX is an input if MCM=0 and an output if MCM=1. In the 
latter case, one device (the one whose MCM bit=1) can supply the clock (fre- 
quency=one-fourth of CLKOUT1 frequency) for all devices on the bus. The 
clock can be supplied by an external source if MCM=0 for all devices. No more 
than one device should have MCM=_1 at any given time. The specification of 
which processor is to supply clock and framing signals is typically made only 
once, during system initialization. The TADD line carries the transmit address 
byte sent by the transmitting device. Figure 5—14(b) shows how the four-wire 
bus is formed from the six serial port pins. 
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The TDM received address (TRAD) register holds the last value received from 
the TADD line. This register can be used to verify the integrity of the serial inter- 
face and/or to extract partial or complete information as to which device in the 
system transmitted the last data word. For example, if there is a unique trans- 
mit address for each channel, the transmitter can be uniquely identified. Bits 
0-7 hold the received transmit address. Bits 10-8 hold the last time slot num- 
ber (i.e., channel ID number). Bits 13-11 hold the current time slot number. 
This number is simply the last slot number plus one, modulo 8. 


Figure 5-16 shows the timing for TDM port transfers. Near the end of a frame 
(8 time slots), the single device having TXM=1 outputs apulse one TCLK cycle 
wide on the TFRM line. TFRM pulses occur only once every 128 TCLK cycles. 
TFRM is driven low during the remainder of the frame. 


Figure 5-16. Serial Port Timing in TDM Mode 
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After the LSB of a given data word is transmitted, the TDAT line goes into the 
high-impedance state. TDAT comes out of high impedance shortly after the 
next falling edge of TCLK. The next 15 data bits are transmitted on rising edges 
of TCLK. In Figure 5-16, the data bits are shown with subscripts that indicate 
the channel (i.e., time slot) number. 


The timing for TADD signal for channel 0 is shown in Figure 5—16. After the LSB 
of the channel 7 data is transmitted, the TADD line goes into the high-impe- 
dance state. TDAT comes out of high impedance shortly after the next falling 
edge of TCLK. The 8 address bits associated with channel 0 are then trans- 
mitted on TADD. After these have been transmitted, TADD goes high and re- 
mains high until the channel 1 transfer begins. For channels 1-7, TADD carries 
the address during the first 8 clocks and is high during the remaining 8 clocks. 
Note that the short interval between completion of transmission of the channel 
7 LSB andinitiation of transmission of the channel 0 MSBis the only time during 
which TADD is in the high-impedance state. Note that the address line TADD 
must be pulled down to Vs if there are any channels available with no proces- 
sor transmitting data. This is due to the fact that the address line could float 
high. This indicates that when no one is transmitting, all devices will receive 
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5.7 Timer 


The timer is an on-chip down counter that can be used to periodically generate 
CPU interrupts. The timer is decremented by one at every CLKOUT1 cycle. 
Atimer interrupt (TINT) is generated each time the counter decrements to zero. 
The timer thus provides a convenient means of performing periodic I/O or other 
functions. Figure 5-17 shows a logical block diagram of the timer. When the 
timer is stopped, (TSS = 1), the internal clocks to the timer are shut off, allowing 
the device to run in a lower power mode of operation. 


Figure 5-17. Timer Block Diagram 
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The timer interrupt rate is given by 


Ne ag RY ae xe TODAS MPR Sa) 


where tec) is the period of CLKOUT1, uis the sum of the TDDR contents (see 
Table 5-10) plus 1, and v is the sum of the PRD contents (see Figure 5-17) 
plus 1. 


Thus, the timer interrupt rate is equal to the CLKOUT1 frequency divided by 
two independent factors. Referring to Figure 5-17, each of the two divisors is 
implemented with a down counter and period register. The counter and period 
registers for the first stage are the PSC and TDDR fields of the TCR, respec- 
tively, and each is 4 bits wide. The counter and period registers for the second 
stage are the memory-mapped, 16-bit wide TIM and PRD registers. Each time 
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being made, it may be more accurate to stop the timer to read these two values. 
The timer can be stopped by setting the TSS bit to one and restarted by reset- 
ting this bit to zero. 


The timer provides a convenient and efficient way to generate a sample clock 
for an analog interface. Consider the following example of using the timer to 
generate a sample rate of 50 kHz. The initialization for this example is as fol- 
lows: 


*  Clkin frequency = 20 MHz; timer is running at 10 MHz. 
* 


LDP #0 

SPLK #199,PRD Load timer period for 20 usec period. 
OPL #8, IMR ;Set timer interrupt mask bit 

SPLK #20h, TCR ;reload and start timer. 

OPL #1000h,IFR ;Clear any pending timer interrupts. 
CLRC INTM ;global interrupt enable. 


* 


Consider an A/D that is operating at this sample rate. A typical interrupt service 
routine (ISR) would be as follows: 


x 50 kHz sample rate A/D interrupt service routine 
* 


TIMER_ISR MAR *, AR3;Use auxiliary register reserved for Timer ISR. 
IN *, 14 ;Read A/D. 
RETE 7Re-enable interrupts and return. 
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machine cycle to perform a read or a write. The dual-access RAM can be read 
from and written to inthe same cycle. The 1056 words of dual-access RAM are 
configured in three blocks: block 0 (BO) is 512 words at address 0100h—02FFh 
in local data memory or OFEQOh-OFFFFh in program space; block 1 (B1) is 512 
words at address 0300h—04FFh in local data memory; and block 2 (B2) is 32 
words at address 060h in local data memory. 


The TMS320C51 removes the 2K boot ROM from the device’s program 
memory space along with 8K words of single-access program/data RAM. In- 
stead, the device replaces the 8K words of RAM with an 8K-word block of 
maskable ROM. The ROM is located in the address range Oh—1FFFh in pro- 
gram space. The additional 1K words of single-access RAM are mapped to 
data space (800h-OBFFh), program space (2000h—23FFh), or both spaces. 
The dual-access blocks of RAM onthe TMS320C51 are mapped at the same 
addresses as the TMS320C50. The TMS320C50 and TMS320C51 memory 
maps are shown in Figures 6—1(a) and 6—1(b). 


The major topics in this section are listed below: 
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Figure 6-1. _7MS320C51 and TMS320C50 Memory Maps (Concluded) 
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OPL #010h,PMST ;Map TMS320C50 9K RAM or TMS320C51 1K RAM 
jin program space. , 
SETC CNF 7Map BO to program space. 


Table 6-1 shows the possible program memory configurations available onthe 
TMS320C50 device. Table 6-2 shows the possible program memory configu- 
rations for the TMS320C51 device. Note that all addresses are specified in 
hexadecimal. 
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Table 6-1. 1MS320C50 Program Memory Configuration Control 
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6.1.2 Program Memory Address Map 


The reset, interrupt, and trap vectors are addressed in program space. These 
vectors are soft—meaning that the processor, when taking the trap, will load 
the PC with the trap address and execute code at the vector location. Two 
words are reserved at each vector location for a branch instruction to the appro- 
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6.1.3. Program Memory Addressing 
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The program memory space contains the code for applications. Itcan also hold 
table information and immediate operands. The program memory is accessed 
only by the PAB. The address for this bus is generated by the program counter 


~ (PC) when instructions and long immediate operands are addressed. It can 


also be loaded with a long immediate, low accumulator, or registered address- 
es for block transfers, multiply/accumulates, and table read/writes. 


The TMS320C5x devices address code by puiting the PC out on the PAB bus 
and reading the appropriate location in memory. While the read is executing, 
the PC is incremented for the next fetch. If there is a program address disconti- 
nuity (for example, branch, call, return, interrupt, or block repeat), the appropri- 
ate address is loaded into the PC. The PC is also loaded when operands are 
fetched from program memory. Operands are fetched from program memory 
when the device reads or writes to tables (TBLR and TBLW), when it transfers 
data to/from data space (BLPD and BLDP), or when it uses the program bus 
to fetch a second multiplicand (MAC, MACD, MADS, and MADD). The PC is 
loaded with a value other than PC + 1 in the following ways: 


Long immediate address with branch or call instructions. 

Long immediate address with MAC, MACD, BLDP or BLPD instructions. 
Low accumulator with BACC or CALA instructions. 

Low accumulator with TBLR or TBLW instruction. 

BMAR with MADS, MADD, BLDP or BLPD instructions. 

CALU with an interrupt vector address (INTR, TRAP, or NMI) instruction. 
CALU with PASR when at the end of a block repeat loop. 

Pop top of stack with a return instruction. 


OOOoooooo 


The address flow of a program can be traced externally through the address 
visibility feature. This feature can debug during program development; it is en- 
abled after reset and disabled/re-enabled by setting/clearing the AVIS bitin the 
PMST register. The address visibility mode puts the program address out to 
the address pins of the device even when on-chip program memory is ad- 
dressed. Note that the memory control signals (PS, RD, etc.) are not active in 
address visibility mode. Instruction addresses can be externally clocked with 
the falling edge of the instruction acquisition (IAQ) pin. Instruction addresses 
include both words of a two-word instruction but do not include block transfers, 
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Write Enable 

Interrupt Acknowledge 

Memory Ready to Complete Cycle 
Request for Control of Memory Interface 
Acknowledge HOLD Request 

Bus Request 


Acknowledge Bus Request (when HOLDA is low) 


An example of a minimal external program memory interface is shown in 
Figure 6-2. In this figure, the TMS320C5x device interfaces to an 8K x 8 
EPROM. This is a useful iriterface when boot-loading code. The boot loader 
canconcatenate the bytes to form the 16-bit word instructions. The use of 8-bit- 
wide memories saves power, board space, and cost over 16-bit wide memory 
banks. The 16-bit wide memory banks can be used with the same basic inter- 
face as the 8-bit wide memories. . 
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low and a half cycle after WE goes high; this prevents buffer conflicts on the 
external buses. Additional write cycles can be obtained by modifying the soft- 
ware wait-state generator registers. Subsection 6.2.4 includes an example of 
interfacing to external RAM. 
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Table 6-4. TMS320C50 Local Data Memory Configuration Control 
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Table 6-5. TMS320C51 Local Data Memory Configuration Control 
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6.2.2 Local Data Memory Address Map 


The 64K words of local data memory space include the memory-mapped regis- 
ters for the device. The memory-mapped registers reside in data page 0. Data 
page 0 has five sections of register banks: core CPU registers, peripheral reg- 
isters, test/emulation reserved area, I/O space port hole, and scratch-pad 
RAM. 


Ci There are 28 core CPU registers. These registers can be accessed with 
zero wait states. Some of these registers can be accessed through paths 
other than the data bus (i.e., auxiliary registers can be loaded by the 
ARAU). : 


C4 ~The peripheral registers are the control and data registers used in the pe- 
ripheral circuits. These registers reside on a dedicated peripheral bus 
structure called the TIBUS. They require one wait state when accessed. 


Ch =6The test/emulation reserved area is used by the test and emulation sys- 
tems for special information transfers. Writing to this area can cause the 
device to change its operational mode and, therefore, affect the oper- 
ation of the application. 


jk The I/O space port hole provides addressability to 16 words of I/O space 
within the data address space. This allows access to I/O space (other than 
IN and OUT instructions) via the more extensive addressing modes avail- 
able within the data space. For example, the SACL instruction can write 
to an I/O memory-mapped port like an OUT instruction does. The external 
interface looks like an OUT instruction occurs (IS active). Port addresses 
reside off-chip and are subject to external wait states. 


Ly The scratch-pad RAM block (B2) includes 32 words of dual-access RAM 
for variable storage without fragmenting the larger RAM blocks, both on 
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Local Data Memory 
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Table 6-6. Data Page 0 Address Map (Concluded) 


Description 
| Dec | Hex 
| mM | 6 || Timer Register 
Pp prD | a7 | | Pericd Register 
| Tor | 88 | 26 | Timer ControlRegister 
as ee ee 
Ppowsr | 40 | 28 | Program/DataSWWWaltState Register 
Frowsr | __41__ | 29 | VOPonSWWaltStatoRegister CS 
CWSR 2A 
TRCV 


PP 
o 


TDXR 3 
TSPC 
TCS 


I/O Port 80 
/O Port 81 
/O Port 82 
I/O Port 83 
/O Port 84 
VO Port 85 
I/O Port 86 
/O Port 87 
I/O Port 88 
V/O Port 89 
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6.2.2.1 Auxiliary Register (ARO-AR7) 


The eight 16-bit auxiliary registers (ARO—-AR7) can be accessed by the CALU 
and modified by the ARAU or the PLU. The primary function of the auxiliary reg- 
isters is generating 16-bit addresses to data space. However, these registers 
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The block repeat counter register (BRCR) holds the count value for the block 
repeat feature. This value is loaded before a block repeat operation is initiated. 
It can be changed while a block repeat is in progress; however, take caution 
in this case to avoid infinite loops. The program address start register (PASR) 
holds the start address of the block of code to be repeated. The program ad- 
dress end register (PAER) holds the end address of the block of code to be re- 
peated. Both these registers are loaded by the RPTB instruction. Block repeats 
are described in more detail in subsection 3.6.5. 


6.2.2.7 Interrupt Registers (IMR,IFR) 


The interrupt mask register (IMR) is used to individually mask off specific inter- 
rupts at required times. The interrupt flag register (IFR) indicates the current 
status of the interrupts. Interrupts are described in detail in Section 3.8. 


6.2.2.8 Global Memory Allocation Register (GREG) 


The global memory allocation register (GREG) is used to allocate parts of the 
data address space as global memory. This register defines what amount of 
the local data space will be overlayed by global data space. The operation of 
GREG is further discussed in Section 6.3. 


6.2.2.9 Dynamic Bit Manipulation Register (DBMR) 


The dynamic bit manipulation register (DBMR) is used in conjunction with the 
PLU to provide a dynamic (execution time programmable) mask register. The 
use of this register is described in Section 3.7. 


6.2.2.10 Temporary Registers (TREGO, TREG1, TREG2) 


TREGO holds one of the multiplicands of the multiplier. It can also be loaded 
via the CALU with the following instructions: LT, LTA, LTD, LTP, LTS, SQRA, 
SQRS, MAC, MACD, MADS, and MADD. TREG1 holds a dynamic (execution- 
time programmable) shift count for the prescaling shifter. TREG2 holds a dy- 
namic bit address for the BITT instruction. 


6.2.2.11 Processor Mode Status Register (PMST) 


The processor mode status register (PMST) controls memory configurations 
of the TMS320C5x devices (with exception of the CNF bit in ST1). The PMST 
register is described in more detail in subsection 3.6.3 and in the configurability 
sections of Chapter 6. 


6.2.2.12 Serial Port Registers (DRR, DXR, SPC) 


Three registers are used to control and operate the serial port. The serial port 
control register (SPC) contains the mode control and status bits of the serial 
port. The data receive register (DRR) holds the incoming serial data, and the 
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instructions with only one data memory operand and program address bus 

(PAB) on instructions with a second data memory operand. An instruction op- 

erand is provided to the CALU in eight ways, as described in subsection 3.4.2. 

However, data memory addresses are generated in one of the following five 

ways: 

1) By the direct address bus (DAB) using the direct addressing mode (e.g., 
ADD 010h) relative to the data page pointer (DP), 

2) By the direct address bus (DAB) using the memory-mapped addressing 
mode (e.g., LAMM PMST) within data page zero, 

3) By the auxiliary register file bus (AFB) using the indirect addressing mode 
(e.g., ADD *), 

4) By the value pointed at by the PC in long immediate address mode (e.g., 
BLDD TBL1,*+), 

5) Bythe block memory address register (BMAR) in registered block memory 
addressing mode (e.g., BLDD *+). 


In the direct addressing mode, the 9-bit data memory page pointer (DP) points 
to one of 512 pages (1 page=128 words). The data memory address (dma), 
specified by the seven LSBs of the instruction, points to the desired word within 
the page. The address on the DAB is formed by concatenating the 9-bit DP with 
the 7-bit dma. 


Figure 6-3 illustrates the direct addressing mode. In the illustration, the oper- 
and is fetched from data memory space via the data bus, and the address is 
the concatenated value of the DP and the seven LSBs of the instruction. For 
the following example, consider DP = 0184h and TEMP1 = O60h: 


LACC TEMP1 ;ACC = TEMP1. 
In the example, the accumulator is loaded with DATA(C260). 


Figure 6-3. Direct Addressing Mode 
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ADD  010h 
Machine Code = 0010 o000 |jolo01 o000 
, / / 
| / / 
pp = [1100 a110 1 - rs 
‘ 4 a 
DAB = |1100 141110 #41001 #0000 
Operand = Data(DAB) 


Note: DAB is the 16-bit internal address bus for data memory. 


The memory-mapped addressing mode operates much like the direct address- 
ing mode except that the most significant 9 bits of the address are forced to 
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This routine uses indirect addressing to calculate the following equation: 


\ X(I) x Y(TI) 
/ 


The routine assumes that the X values are located in on-chip RAM block BO, 
and the Y values in block Bl. The efficiency of the routine is due to the use 
of indirect addressing and the repeat instruction. 


MAR *,4 ;ARP POINTS TO ADDRESS REGISTER 4. 


SETC CNF ;CONFIGURE BLOCK BO AS PROGRAM MEMORY. 

LAR AR4, #0300h 7;POINT AT BEGINNING OF DATA MEMORY. 

RPTZ #9 ;CLEAR ACC AND P; REPEAT NEXT INST. 10 TIMES 
MAC OFFOOh, *+ s;MULTIPLY AND ACCUMULATE; INCREMENT AR4. 
APAC zACCUMULATE LAST PRODUCT. 

RET 


In the long immediate addressing mode, an operand is addressed by the sec- 
ond word of a two-word instruction. In this case, the program address/data bus 
(PAB) is used for the operand fetch. The prefetch counter (PFC) is pushed onto 
the microcall stack (MCS), and the long immediate value is loaded into the 
PFC. The PAB is then used for the operand fetch or write. At the completion 
of the instruction, the MCS is popped back to the PFC. The PC is incremented 
by two, and execution continues. This technique is used when two memory ad- 
dresses are required for the execution of the instruction. The PFC is used so 
that when the instruction is repeated, the address generated can be autoin- 
cremented. Figure 6-6 illustrates this mode. In this illustration, the source ad- 
dress (OPERAND1) is fetched via PAB, and the destination address (OPER- 
AND2) uses the direct addressing mode. 
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TMS320C5x devices provide a register file containing eight auxiliary registers 
(ARO-AR7). The auxiliary registers may be used for indirect addressing of the 
data memory or for temporary data storage. Indirect auxiliary register address- 
ing (see Figure 6—8) allows placement of the data memory address of an in- 
struction operand into one of the auxiliary registers. These registers are 
pointed to by a three-bit auxiliary register pointer (ARP) that is loaded with a 
value from 0 through 7, designating ARO through AR7, respectively. 


Figure 6-8. Indirect Auxiliary Register Addressing Example 


Auxiliary Register File Data Memory Map 
0000h 
: ARI 
Auxiliary O3FFh 
Register Pointer 0400h 
(in STO) Ape: Loe External 
ARP |0|1/1|—® ARS [0 F F 3 A hi—® _ OFF3Ah 
AR4 103 Bh OFFFEh 
AR5 26Bih 
ARG 
AR7 843 Dh 


The auxiliary registers and the ARP may be loaded from data memory, from 
the accumulator, from the product register, or by an immediate operand de- 
fined in the instruction. The contents of these registers may also be stored in 
data memory or used as inputs to the CALU. These registers appear in the 
memory map as described in Table 6-6 on page 6-15. 


The auxiliary register file (ARO-AR7) is connected to the auxiliary register 
arithmetic unit (ARAU), shown in Figure 6-9. The ARAU may autoindex the 
current auxiliary register while the data memory location is being addressed. 
Indexing either by + 1 or by the contents of the INDX register may be per- 
formed. As a result, accessing tables of information does not require the cen- 
tral arithmetic logic unit (CALU) for address manipulation. The CALU is now 
free to perform other operations. 


If more advanced address manipulation is required, such as multidimensional 
array addressing, the CALU can directly read from or write to the auxiliary reg- 
isters. Take care, however, when writing from the CALU to the auxiliary register 
because the ARAU update of the ARs is done during the decode phase (sec- 
ond cycle) of the pipeline, whereas the CALU write is done during the execution 
phase (fourth cycle) of the pipeline. Therefore, the two instructions directly fol- 
lowing the CALU write should not use the auxiliary register written by the 
CALU. | 
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Local Data Memory 


As shown in Figure 6-9, the index register, the compare register, or the eight 
LSBs of the instruction register can be connected to one of the inputs of the 
ARAU. The other input is fed by the current AR (being pointed to by ARP). 
AR(ARP) refers to the contents of the current AR pointed to by ARP. The ARAU 


performs the following functions: 
AR(ARP) + INDX > AR(ARP) 


AR(ARP) — INDX — AR(ARP) 


AR(ARP) + 1 3 AR(ARP) 
AR(ARP) — 1 > AR(ARP) 
AR(ARP) > AR(ARP) 


AR(ARP) + IR(7-0) — AR(ARP) 


AR(ARP) — IR(7-0) > AR(ARP) 
AR(ARP) + rc(INDX) > AR(ARP) 
AR(ARP) — rc(INDX) > AR(ARP) 


If (AR(ARP) == ARCR), then TC = 1 
If (AR(ARP) < ARCR), then TC = 1 
If(AR(ARP) > ARCR), then TC = 1 
If(AR(ARP) # ARCR), then TC = 1 


if (AR(ARP) = CBER), 
then AR(ARP) =CBSR 


Index the current AR by adding a 
16-bit unsigned integer contained in 
INDX. Example: ADD *0+. 


Index the current AR by subtracting 
a 16-bit unsigned integer contained 
in INDX. Example: ADD *0-. 


Increment the current AR by one. 
Example: ADD *+. 


Decrement the current AR by one. 
Example: ADD *~. 


Do not modify the current AR. Exam- 
ple: ADD *. 

Add an 8-bit immediate value to cur- 
rent AR. Example: ADRK #055h. 


Subtract an 8-bit immediate value 
from current AR. Example: SBRK 
#055h. 


Bit-reversed indexing, add INDX 
with reverse-carry (rc) propagation. 


Example: ADD *BRO+. 


Bit-reversed indexing, subtract 
INDX with reverse-carry (rc) propa- 
gation. Example: ADD *BRO-. 


Compare current AR with 
ARCR and if condition is true, 

then set TC bit of the status 
register (ST1) to one. If false, 

then clear TC. Example: CMPR 3. 


lf at end of circular buffer, reload 
start address. 


The index register (INDX) can be added to or subtracted from AR(ARP) on any 
AR update cycle. This 16-bit register is one of the memory-mapped registers 
and is used to increment or decrement the address in steps larger than one for 
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auxiliary register modification occurring. The ARAU will not detect an AR up- 
date that steps over the value contained in CBER. Note that the test in the 
ARAU is performed before the auxiliary register update. 


6.2.4 External Interfacing to Local Data Memory 


The TMS320C5x devices can address up to 64K words of off-chip local data 
memory. These are the key signals for this interface: 


A0Q-A15 16-Bit Bidirectional Address Bus 
DO-D15 16-Bit Bidirectional Data Bus 
DS Data Memory Select 
STRB External Memory Access Active Strobe 
RD Read Select (External Device Output Enable) 
WE Write Enable 
READY Memory Ready to Complete Cycle 
HOLD Request for Control of Memory Interface 
HOLDA Acknowledge HOLD Request 
BR. Bus Request 
_ TAQ Acknowledge Bus Request (when HOLDA is low) 


An example of an external RAM interface is shown in Figure 6-10. In this fig- 
ure, the TMS320C5x device interfaces to four 16K x 4-bit RAM devices. The 
data memory select (DS) is directly connected to the chip select (CS) of the 
devices. This means the external RAM block will be addressed in any of the 
four 16K banks of local data space. If there are additional banks of off-chip data 
memory, a decode circuit that gates DS with the appropriate address bits can 
be used to drive the memory block chip select. 
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Local Data Memory 
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device. If the RAM device does not have an OE pin, then DS should be gated 


_ with STRB and connected to the CS pin of the RAM to ) implement the same 


function. The WE signal of the TMS320CS5x is tied to the WE signal of the RAM. 
The TMS320C5x takes at least two cycles on all external writes, including a 
half cycle before the WE goes low and ahalf cycle after WE goes high; this pre- 
vents buffer conflicts on the external buses. Additional wait states may be gen- 
erated with the software wait-state generators. 
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6.3.2 Global Memory Addressing 


When adata memory address, either direct or indirect, corresponds to a global 
data memory address (as defined by GREG), BR is asserted low with DS to 
indicate that the processor wishes to make a global memory access. External 
logic then arbitrates for control of the global memory, asserting READY when 
the TMS320C5x device has contro!. The length of the memory cycle is con- 
trolled by the READY signal. In addition, the software wait-state generators 
can be used to extend the access times for slower, external memories. The 
wait-state generators corresponding to the overlapped memory address space 
in local data space will generate the wait states for the corresponding address- 
es in global data memory space. 


6.3.3 External Interfacing of Global Memory 


Global memory can be used in various digital signal processing tasks, such as 
filters or modems, where the algorithm being implemented may be divided into 
sections with a distinct processor dedicated to each section. With multiple pro- 
cessors dedicated to distinct sections of the algorithm, throughput may be in- 
creased via pipelined execution. Figure 6—11 illustrates an example of a global 
memory interface. Since the processors can be synchronized by using the RS 
pin, the arbitration logic may be simplified and the address and data bus trans- 
fers made more efficient. 


Figure 6-11. Global Memory Interface 
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Input/Output Space 


6.4 Input/Output 


Space 


The TMS320C5x devices support an I/O address space of 64K 16-bit parallel 
input and output ports. I/O ports allow access to peripherals typically used in 
DSP applications such as codecs, digital-to-analog (D/A) converters, and ana- 
log-to-digital (A/D) converters. This section discusses addressing !/O ports 
and interfacing I/O ports to external devices. 


6.4.1. Addressing Input/Output Ports 


Access to external parallel I/O ports is multiplexed over the same address and 
data bus for program/data memory accesses. I/O space access is distin- 
guished from program/data memory accesses by the IS signal going active 
low. All 65,536 ports can be accessed via the IN and OUT instructions, as 
shown in the following example: 


IN OFFFEh,DAT7 ;Read data to data memory from external 
device on port 65534. 

OUTOFFFFh,DAT7 ;Write data from data memory to external 
;device on port 65535. 


Sixteen of the 64K I/O ports are mapped in data memory space as shown in 
Table 6—4. The I/O ports may be accessed with the IN and OUT instructions 
along with any instruction that reads or writes a location in data space. In this 
way, I/O is treated the same way as memory. The following example illustrates 
the use of direct addressing to access an I/O device on port 51h: 


SACL 51h ; (DP = 0). Store accumulator to external device 
7on port 81. 


Accesses to memory-mapped I/O space are distinguished from program/data 
accesses by the IS signal. DS is not active, even though the user is writing to 
data space. 


6.4.2 Interfacing to I/O Ports 
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The RD and WE signals can be used along with chip-select logic to output data 
to an external device. The port address can be decoded and used as a chip 
select forthe input or output device. The access times to I/O ports can be modi- 
fied through the CWSR and IOWSR software wait-state registers. The BIG bit 
in the CWSR register determines how the I/O space is mapped to the software 
control registers. If the BIG bit is set to 0 in the CWSR register, the first sixteen 
ports are assigned in pairs to a software wait-state generator. Each following 
set of 16 registers maps accordingly to the first 16 ports when BIG = 0. For ex- 
ample, the 16 ports that correspond to the addresses in the data space port 
hole (ports 50h-5Fh) have the same wait states as ports O-Fh. If the BIG bit 
is set to 1, the wait states are mapped to program space in eight 8K blocks of 
memory. The following table shows how the software wait states are assigned 
to I/O ports according to the BIG bit: 


Memory | 


Direct Memory Access (DMA) 


6.5 Direct Memory Access (DMA) 


The TMS320C5x supports multiprocessing designs using direct memory ac- 
cess (DMA) of external memory or the TMS320C5x on-chip single access 
RAM. The DMA features can be used for multiprocessing by temporarily halt- 
ing the execution of one or more processors to allow another processor to read 
from or write to the TMS320C5x’s local off-chip memory or on-chip single-ac- 
cess RAM. The external memory access may be controlled by using the 
HOLD/HOLDA signals. The DMA access of internal RAM on the TMS320C5x 


is controlled by the HOLD, HOLDA, R/W, STRB, BR, and IAQ lines. 


| Decode Logic 
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Figure 6-12. I/O Port Interfacing Logic 
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The multiprocessing is typically a master-slave configuration. The master may 
initialize a slave by downloading a program into its program memory space 
and/or may provide the slave with the necessary data by using external 
memory to complete a task. In a typical TMS320C5x direct memory access 
scheme, the master may be a general-purpose CPU, another TMS320C5x, or 
even an analog-to-digital converter. A simple TMS320C5x master-slave con- 
figuration is shown in Figure 6—13. 


Ea 


Figure 6-13. Direct Memory Access Using a Master-Slave Configuration 
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Figure 6-14. Direct Memory Access in a PC Environment 


6-38 


Address Address 


Local 
Program/Data 
Memory 


Master CPU 
Address 


Data 


Arbitration 
Logic 


System Bus 


Disk Controller 


TMS320C5x 


Address Ready HOLDA 


Data 


The TMS320C5x device also provides direct access of the on-chip RAM for ex- 
ternal devices. DMA of the on-chip single-access RAM requires the following 
signals: 


HOLD External request for control of address, data, and control lines. 


HOLDA __ Indicates to external circuitry that the memory address, data, and 
control lines are in high impedance, allowing external access of 
on-chip single-access RAM. 


BR Bus request signal. Externally driven low in hold mode to indicate a 
request for access to on-chip single-access RAM. 

IAQ Acknowledge BR request for access to on- chip single-access 
RAM while HOLDA is low. 

R/W Read/write signal indicates the data bus direction for DMA reads 


(high) and DMA writes (low). 


STRB When active low and IAQ and HOLDA are low, this input signal is 
used to select the memory access. STRB determines the duration 
of the memory access. 


A15-AO0 Address inputs during HOLDA and BR active low. 
D1i5-D0 DMAdata. 


In order to access the TMS320C5x device’s on-chip single-access RAM, a 
master device must control the device. The master processor initiates a DMA 
transfer by placing the TMS320C5x device in HOLD. Once the device re- 
sponds with a HOLDA, the master can select access to the internal on-chip 
single-access RAM by lowering the BR input. The device will respond with an 
TAQ to acknowledge access to the on-chip memory. At this time, the processor 
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6.6 Memory Management 


6.6.1 Block Moves 


The TMS320C5x devices have a programmable memory map, which can vary 
for each application. Instructions are provided for integrating the device 
memory into the system memory map. The TMS320C50 device includes 2K 
words of boot ROM, 9K words of single-access RAM, and 1056 words of dual- 
access RAM. The TMS320C51 device includes an 8K program ROM, 1K 
words of single-access RAM, and 1056 words of dual-access RAM. Examples 
of moving and configuring memory are provided in this section. 


The TMS320C5x devices address a large amount of memory but are limited 


inthe amount of on-chip memory. Several instructions are available for moving 
blocks of data from off-chip slower memories to on-chip memory for faster pro- 
gram execution. In addition, data can be transferred from on-chip to off-chip 
for storage or multiprocessor applications. 


The BLDD instruction facilitates the transfer of data from external or internal 
data memory to internal or external data memory. Example 6-1 illustrates the 
use of the BLDD command to move data (for example, a table of coefficients) 
from external memory to internal data RAM. 


Example 6-1. Moving External Data to Internal Data Memory With BLDD 
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+ + + 


This routine uses the BLDD instruction to move external data memory to 
internal data memory. 


MOVED LMMR BMAR, #2800h;BMAR contains source address in data memory. 


LAR AR7,#300h ;AR7 contains destination address in data memory. 
MAR *, ART 7; LARP = AR7. 

RPT #511 7Move 512 values to data memory block Bl. 

BLDD BMAR, *+ 

RET 


For systems with external data memory but no external program memory, the 
BLDP instruction can be used to move additional blocks of code into internal 
program memory. Example 6-2 illustrates the use of the BLDP instruction. 
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Example 6-5. Moving Data Memory to Program Memory With TBLW 


* 

* This routine uses the TBLW instruction to move data memory to 

* program memory. The calling routine must contain the destination program 

* memory address in the accumulator. 

* 

TABLEW MAR *, AR4 7;LARP = AR4. 
LAR AR4, #300h ;AR4 contains source address in data memory. 
RPT #511 Move 512 items from data memory to program 
TBLW *+ zmemory. 
RET ;Accumulator contains address of program RAM. 


The IN and OUT instructions move data from data memory to an external port. 
The use of these instructions is illustrated in the following examples. 


Example 6-6. Moving Data From I/O Space to Data Memory With IN 


* 


* This routine uses the IN instruction to move data from I/O space into 


* data memory. 
* 


INPUT MAR *, AR2 ;LARP = AR2. 
LAR AR2, #300h 7AR2 = 300h. 
;Input value to data memory at 300h 
IN *+,1 7from port 1. 
RET 


Example 6-7. Moving Data From Data Memory to I/O Space With OUT 


+ + 


This routine uses the OUT instruction to move data from data space to 


* I/O space. 
* 
OUTP MAR *,AR1 ;LARP = AR1 
LAR AR1, #200h 7AR1 = 200h 
;Output value to port 1. 
OUT e+, 1 
RET 


6.6.2 On-Chip Boot ROM (TMS320C50) 
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The fifth generation of the Texas Instruments digital signal processors provides 
two different options regarding the chip count and the system flexibility. One 
member of the family, TMS320C51, has 8K words of mask-programmable on- 
chip ROM that allows the customer to use a code-customized processor for 
specific applications while taking advantage of the following: ) 


C4 Greater memory expansion 
Cy ~=Lower system cost 

[i ~=Less hardware and wiring 
Cy Smaller PCB 


User routines may be submitted customers to Texas Instruments to be masked 
to the on-chip ROM of TMS320C51. 
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low-order one. Data is read from the lower eight data lines, ignoring the upper 
byte on the data bus. The destination address and the length of the code are 
specified by the first two 16-bit words read from the source. The length is de- 
fined as: 


length = number of 16-bit words to be transferred — 1 


The code is transferred from the global data memory to the program memory. 
Note that there is at least a four-instruction cycle delay between a read from 
EPROM and write to destination address. This ensures that if the destination 
is external memory, there is enough time to turn off the source memory 
(EPROM) before the write operation is done. 


Memory 
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Processor Initialization 


7.1. Processor Initialization 


Prior to the execution of a digital signal processing algorithm, it is necessary 
to initialize the processor. Generally, initialization takes place anytime the pro- 
cessor is reset. 


The processor is reset by applying a low level to RS input for at least five ma- 
chine cycles; IPTR bits of PMST register are all set to zero, thus mapping the 
vectors to page zero in program memory space. This means that the reset vec- 
tor always resides at program memory location 0. This location normally con- 
tains a branch instruction in order to direct program execution to the system 
initialization routine. A hardware reset clears all pending interrupt flags and 
sets the INTM (global enable interrupts) bit to 1, thereby disabling all interrupts. 
It also initializes various status bits and peripheral registers. Refer to subsec- 
tion 3.8.1 of this book for details. 


To configure the processor after the reset, the following internal functions 
should be initialized. 


Memory-mapped core processor and peripheral control registers 
Interrupt structure (INTM) 

Mode control (OVM, SXM, PM, AVIS, NDX, TRM) 

Memory control (RAM, OVLY, CNF) 


Auxiliary registers and the auxiliary register pointer (ARP) 
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Data memory page pointer (DP) 


The OVM (overflow mode), TC (test/contro! flag), IMR (interrupt mask regis- 
ter), auxiliary register pointer (ARP), auxiliary register pointer buffer (ARB), 
and data memory page pointer (DP) are not initialized by reset. 


Example 7—1 shows coding for initializing the TMS320C5x to the following ma- 
chine state, and for the initialization performed during hardware reset: 


G4 Internal single-access RAM configured as program memory 
Interrupt vector table loaded in internal program memory 
Interrupt vector table pointer (IPTR) 


Internal dual-access RAM blocks filled with zero 
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Interrupts enabled 
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Interrupts 


The TMS320C5x devices have four external maskable user interrupts 
(INT1-INT4) and one nonmaskable interrupt (NMI) available for external de- 
vices. Internal interrupts are generated by the serial ports, the timer, and by the 
software interrupt instructions (INTR, TRAP, and NMI). The interrupt structure 
is described in subsection 5.1.2, /nterrupts. 


The TMS320C5x devices are capable of generating software interrupts using 
INTR instruction. This allows any of the 32 interrupt service routines to be ex- 
ecuted from the user’s software. The first 20 ISRs are reserved for external in- 
terrupts, peripheral interrupts, and future implementations. The other 12 loca- 
tions in the interrupt vector table are user-definable. The INTR instruction can 
invoke any of the 32 interrupts available on the TMS320C5x devices. 


The context saving and restoring function is done in hardware when an inter- 
rupt trap is executed. An 8-deep hardware stack is available for saving return 
addresses of the subroutines and the interrupt service routines. Also, there is 
a one-deep stack (or shadow registers) on the following registers: 


ACC accumulator 

ACCB accumulator buffer 

PREG product register 

STO status register 0 

ST1 status register 1 

PMST processor mode status register 
TREGO temporary register for multiplier 
TREG1 temporary register for shift count 
TREG2 temporary register for bit test 
INDX indirect address index register 
ARCR auxiliary register compare register 


When the interrupt trap is taken, all these registers are pushed onto the one- 
deep stack. These shadow registers are popped when the return-from-inter- 
rupt (RET! or RETE) is executed. Detailed discussion of interrupts are given 
in Section 3.8, /nterrupts. 


The following example illustrates the use of INTR instruction. The foréground 
program sets up auxiliary registers and invokes user-defined interrupt number 
20. Since the context is saved automatically, the interrupt service routine is free 
to use any of the saved registers without destroying the calling program’s vari- 
ables. The routine shown here uses the CRGT instruction to find the maximum 
value of 16 executions of the equation Y=aX*2+bX+c. The X values are 
pointed atby AR1. AR2 and AR3 point to the coefficients and Y results, respec- 
tively. In order to return the result to the calling routine, all the registers are re- 
stored by executing an RET! instruction. The computed value is placed in the 
accumulator, and a standard return is executed because the stack is already 
popped. 
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7.3 Software Stack 


The TMS320C5x has an internal 8-deep hardware stack that is used to save 
and restore return addresses for subroutines and interrupts. See subsection 
3.6.1 for further details. Provisions have been made on the TMS320C5x to ex- 
tend the hardware stack into the data memory. 


The PUSH and POP instructions can access the hardware stack via the accu- 
mulator. Two additional instructions, PSHD and POPD, are included in the in- 
struction set so that the stack may be directly stored to and recovered from the 
data memory. 


A software stack can be implemented by using POPD instruction at the begin- 
ning of each subroutine in order to save the PC in data memory. Then, before 
returning, aPSHD is used to put the proper value back onto the top of the stack. 


‘When the stack has seven values stored on it, and two or more values are to 
be put on the stack before any other values are popped off, a subroutine that 
expands the stack is needed, such as the one shown in Example 7-3. In this 
example, the main program stores the stack, starting location in memory in 
AR2 and indicates to the subroutine whether to push the data from memory 
onto the stack or pop data from the stack to memory. If a zero is loaded into 
the accumulator before calling the subroutine, the subroutine pushes data from 
memory to the stack. If the accumulator contains a nonzero value, the subrou- 
tine pops data from the stack to memory. 


Since the CALL instruction uses the stack to save the program counter, the 
subroutine pops this value into the accumulator and utilizes the BACC instruc- 
tion to return to the main program. This prevents the program counter from be- 
ing stored into a memory location. The subroutine in Example 7-3 uses the 
BCNDD (delayed conditional branch) instruction to determine whether a save 
or restore operation is to be performed. 


Example 7-3. Software Stack Operation 


Cr A A A A A 2 2 2 2 A 2 
7 This routine expands the stack while letting the 

7; Main program determine where to store the stack 

7; contents, or from where to restore them. 

; Entry Conditions: 

7 ACC = 0 (restore stack); 1 (save stack) 

7; AR2 -> Top of software stack in data memory 
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STACK: BCNDD POP,NEQ ;Delayed branch if POPD required 


MAR *, AR2 ;Use AR2 as stack pointer 

POP 7Get return address 

RPT #6 ;repeat 7 times 

PSHD x4 7Put memory in stack 

BACC : 7Return to main program 
POP: MAR km 7Align AR2 

RPT #6 ;Repeat 7 times 

POPD KS ;Put stack in memory 

MAR K+ 7;Realign stack pointer 

BACC 7Return to main program 
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APL x 
SFR 


LOOP RET 


Example 7-5. Using PLU to Do Packing 


etitle ‘Routine to pack input bits in a single word’ 


‘;Keep the LSB only 
*Shift right to throw away unpacked bit 


;Return back 


eereeereees eee ee eeeeee 


* PCKD 
Fe ee ate ow am ee ee ce Oe ta os oe es 
* [Bn ------ BO| 
Fo com ee om ee ce one cs mt ee ee 
* 
* UNPCKD 
Fo ee oe ae oe ee oe oe eo oe 
* 10 -- O |Bn| 
Fo ate ee om ae ne ee ee oe oe ow a 
* 10 -- OJBn-1| 
FO re ee me ee ee ew on 
* 
FO ew ee oe we ee ww ee 
bal |0 -- 0|BO| 
Ko we ew we ee ee ee ee 
hal a 2020 
-mmregs 
.data 
NO_BITS .set 16 sNumber of bits to be packed 
PCKD -set 60h 7Packed word 
UNPCKD .set 61h ;Array of unpacked bits 
-text 
PACK LAR ARO, #$UNPCKD ;ARO points to start of UNPACKED array 
MAR *, ARO ;ARP <- ARO 
LDP #0 ;DP=0 
SPLK #NO_BITS-2,BRCR ;Loop NO_BITS-1 times 
LACC e+ 7;Get the MSB 
RPTB LOOP-1 7Begin looping 
SFL ;Make space for next bit 
. ADD a+ 7;Put next bit 
NOP 
LOOP 
SACL PCKD ;Store the result 
RET 7;Return back 


7.4.2 Multiconditional Branch Instruction 


The TMS320C5x allows multiple conditions to be tested before passing control 
to another section of program. Any of the following 13 conditions may be tested 
individually or in combination with others by CC, RETC, XC, and BCND instruc- 


tions: 


ACC=0 
ACC=0 
ACC<0 
ACCs0 
ACC>0 
ACC20 


GEQ 
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This routine searches through a block of data in the data memory 
to store the maximum value and the address of that value in memory 
locations MAXVAL and MAXADR, respectively. The data block could be 
of any size defined by the Block Repeat Counter Register (BRCR). 


KEY C5X instructions: 


RPTB repeat a block of code as defined by repeat counter BRCR 
CRGT compare ACC to ACCB. Store larger value in both ACC, ACCB. 

Set CARRY bit if a value larger than the previously larger one is found 
XC execute conditionally (1 or 2 words) if flag (Carry) is set. 


MAXADR .set 60h 
MAXVAL .set 61h 


-mmregs 
text 
LDP #0 # point to data page 0 
LAR ARO, #0300h ; AR= data memory addr 
SETC SXM 7 set sign extension mode 
LACC #08000h ; load minimum value 
siz7 Use #07FFFh (largest possible) to check for minimum value 
SACB 3 into ACCB 
SPLK #9,BRCR 3; vpt cont = 9 for 10 data values 
RPTB endb -1 7 repeat block from here to endb-1 
startb 
LACC * load data from <(ARO)> into ACC 


° 
a 


me Ne 


CRGT ; set carry if ACC > previous largest 
Use CRLT to find minimum value 
SACL MAXVAL save new largest which is in ACC & ACCB 


XC #1,C # save addr if current value > previous largest 
SAR ARO, MAXADR: ; a 
MAR *+ 

endb RET 
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At the end of routine, following 
registers contain: 


ACC = 32050 

ACCB = 32050 

(MAXVAL) = 32050 

(MAXADR) = 0307h 

-data 7; data is expected to be in data ram 
-word 5000 3 start address = 0300h 

-word 10000 

-word 320 

word 3200 


word -5600 
word -2105 


eword 2100 
-word 32050 
-word 1000 
word -1 
-end 


7.4.4 Matrix Multiplication Using Nested Loops 
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The TMS320C5x provides three different types of instructions to implement 
code loops. The RPT (single-instruction repeat) instruction allows the following 
instruction to be executed N times. The RPTB (repeat block) instruction repeat- 
edly executes a block of instructions with the loop count determined by the 
BRCR count register. The BANZ (branch if AR not zero) instruction is another 
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7.5 Circular Buffers 
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Circular addressing is an important feature of the TMS320C5x instruction set. 
Algorithms like convolution, correlation, and FIR filters can make use of circular 


. buffers in memory. The TMS320C5x supports two concurrent buffers operat- 


ing via the auxiliary registers. These five memory-mapped registers control the 
circular buffer operation: CBSR1, CBSR2, CBER1, CBER2, CBCR. See sub- 
section 4.1.6 of this book for details. 


The start and end addresses must be loaded in the corresponding buffer regis- 


~ ters before the circular buffer is enabled. Also, the auxiliary register that acts 


as a pointer to the buffer must be initialized with the proper value. 


Example 7-9 illustrates the use of a circular buffer to generate a digital sine 
wave. A 256-word sine-wave table is loaded in the B1 block of dual-access in- 
ternal data memory from external program memory. Accessing the internal 
dual-access memory requires only one machine cycle. The block move ad- 
dress register (BMAR) is loaded with the ROM address of the table. The 
block-move instruction moves 256 samples of sine wave to internal data 
memory, which is then set up as a circular buffer. . 


The start and end addresses of this circular buffer are loaded into the corre- 
sponding registers. The auxiliary register AR7 is also initialized to the begin- 
ning of the sine-wave table. Note the use of SAMM instruction to update AR7. 
This is possible because all auxiliary registers are memory-mapped at page 
0. Finally, the circular buffer #1 is enabled, and AR7 is mapped to that buffer. 
The other circular buffer is disabled. 


Whenever the next sample is to be pulled off from the table, postincrement indi- 
rect addressing may be used with AR7 as the pointer. This ensures that the 
pointer will wrap around to the beginning of the table if the previous sample was 
the last one on the table. 
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Circular Buffers 


eRe 
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The following code does modulo-256 addressing: 


START 


.set 


LDP 
LACL 
SAMM 


04000h 


#0 
#0FFh 
DBMR 


*O+ 
AR7 
#START, ART 


° 


a 


. 
’ 


start address of the buffer 


max value = 255 


increment AR7 by some amount 
extract lower 4 bits 
add the start address 


Software Applications 


Single- “Instruction Repeat Loops 


Example 7-10. Memory-to-Memory Block Moves Using RPT 


ee ee :7 
PRPERR EEE E OPER T TORE ER ETRE ETE R ETE TERETE ETE 


; This routine uses the BLDD instruction t 
7; data memory to internal data memory. 
; 


O = 


we 
=e 
~ 
=e 
=e 
=e 
we 
we 
we 
we 
we 
=e 
=e 
we 
me 
we 
we 
we 
we 
™e 
we 
we 
we 
we 
we 
=e 
we 
ve 
we 
=e 
=e 
=e 
ze 
me 
=e 
=e 
=e 
=e 
=e 
=e 
=e 
we 
=e 
7 
=e 
=e 
ve 
=e 
se 
me 
=e 
=e 
™e 
=e 
=e 
=e 
ze 


SPLK #4000h, BMAR 
LAR AR7,#100h 


BMAR -> source in data memory. 
AR7 -> destination in data memory 


=e te Ne Ne 


MAR *,AR7 LARP = AR7. 

RPT #1023 Move 1024 value to blocks BO and Bl 
BLDD BMAR, *+ 

RET 

Paar a ee 0 O20 2000220000002 
This routine uses the BLDP instruction to move external 


data memory to internal program memory. This 


7 instruction could be used to boot load a program to 
+ the 8K on chip program memory from external data memory. 
Pe a ae oe 0022220000202 200222222 2 22 
MOVEDP : 
SPLK #800H,BMAR ; BMAR -> destination in program memory 
LAR AR7,#0EQO0O0h ; AR7 -> source in data memory. 
RPT #8191 ; Move 8k to program memory space. 
BLDP *+ 
RET 
Pa a 2 22 22002 2 20 2 2022 
This routine uses the BLPD instruction to move external 


program memory to internal data memory. This routine 
is useful for loading a coefficient table stored in 
external program memory to data memory when no external 
data memory is available 
‘ 


coeerereceo eee ee eee eee eee eee 
rarer rrrrrerrerverereeee 


we Te Me Me Me Ne 


=e 


aera eee ee 
MOVEPD: : 
LAR AR7,#100h AR7 -> destination in data memory. 


; 
RPT #127 + Move 128 values from external program 
BLPD #3800h, *+ 7; to internal data memory BO. 


RET 
PERRET ROR ERT R RETR TR RETR R TTT RRR TREE RTE EER ERE REE EETG 
This routine uses the TBLR instruction to move program 


memory to data memory space. This differs from the BLPD 
instruction in that the accumulator contains the address 
in program memory from which to transfer.. This allows 
for a calculated, rather than pre-determined, location in 
program memory to be specified. 


eecereoerr eee eee eee ee eee ase ee eeoceereoes eee eee ee eere sree er eeee 
PEEP REE PERE EET ERT RETR EER TEETER EER TR EEE RTE CE EEE TREE EEE 


Me Ne Ne Ne Be Me Se Ne 


TABLER: 
MAR *,AR3 3; AR3 -> destination in data memory. 
LAR AR3,#300h 
RPT #127 + Move 128 items to data memory block Bl 
TBLR *+ 
RET 


see 

This routine uses the TBLW instruction gag move gia 
memory to program memory. The calling routine must 
contain the destination program memory address in the 
accumulator. 

ta 


a 
te ee A Ae 0 0 0 0 2 2 0 200 200 2 0 


we Me Ne Me Me Ne 


™e 


TABLEW: 
MAR *,AR4 
LAR AR4, #380h 


ARP = AR4. 
AR4 -> source address in data memory. 


=e Ne 
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7.f Subroutines 


Example 7-11 illustrates the use of a subroutine to determine the square root 
of a 16-bit number. The main routine executes to the point where the square 
root of anumber should be taken. At this point, a delayed call (CALLD) is made 
to the subroutine, transferring control to that section of the program memory 
for execution and then returning to the calling routine via the delayed return 
(RETD) instruction when execution has completed. 


This example shows several features of TMS320CS5x instruction set. In particu- 
lar, note the use of delayed-call (CALLD), delayed-return (RETD), and condi- 
tional-execute (XC) instructions. Due to the four-level-deep pipeline on 
TMS320C5x devices, normal branch instructions require 4 cycles to execute. 
Using delayed branches, only two cycles are required for execution. The XC 
instruction is useful where only one or two instructions are to be executed con- 
ditionally. In this example, notice how XC is used to avoid extra cycles due to 
branch instruction. Use of the XC instruction also helps in keeping the execu- 
tion time of a routine constant, regardless of input conditions. This is because 
XC executes NOPs in place of instructions if conditions are not met. 


Example 7-11. Square Root Computation Using XC 
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s; Autocorrelation 


eoeoeoerecece eee eee sree eree eee er seer een essere ere reece seer eee eee eee se eres eee eese 


i a A 2 2 2 A A 2 2 2 2 2 2 2 A 
3 This routine performs a correlation of two vectors and then 
# calls a Square Root subroutine that will determine the RMS 

3 amplitude of the wave form. 


esesee et ecee sec ece ese ese ese eee eee ese eee ese sees esreer eee ee eeee eee eee ee 


Cr A A a 2 i 2 2 2 A i 2 2 2 i 2 A 


AUTOC 


CALLD SQRT 7Call square root subroutine after 


MAR *, ARO # executing next two instructions 


LACC * ;Get the value to be passed to SQRT 
: 7 subroutine 


eer ecec weer eee ees ee eer eee eee eeee eee eset eereseeeseeereereer eee eoeesreeeeeene 


A A A A A 2 2 2 A 2 2 A 2 2 A A 2 A 2 2 2 2 2 2 2 2 2 2 2 2 2 A 


+ Square Root Computation 


+ This routine computes the square root of a number that is located 
7 in the lower half of accumulator. The number is in Q15 format. 


eoeeere reer seer eee eee eee eee er ee eee reser eres ese eeeresrereseereaeeeeeeeeeeeree 


BRCR -set 09h ;DP=0 
STO eset 60h ;Internal RAM block B2 
STl -set 61h 
NUMBER .set 62h 
TEMPR .set 63h 
GUESS eset 64h 
etext 
SQRT SST #0,STO 
sst #1,ST1 ;Save context 
LDP #0 
SETC SXM 7Set SXM=1 
SPM 1 7;Set PM mode for fractional arithmetic 
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7.8 Extended-Precision Arithmetic 


Numerical analysis, floating-point computations, or other operations may re- 
quire arithmetic to be executed with more than 32 bits of precision. Since the 
TMS320C5x devices are 16/32-bit fixed-point processors, software is required 
for the extended precision of arithmetic operations. Subroutines that perform 
the extended-arithmetic functions for TMS320C5x are provided in the exam- 
ples of this section. The technique consists of performing the arithmetic by 
parts, similar to the way in which longhand arithmetic is done. 


The TMS320C5x has several features that help make extended-precision cal-. 
culations more efficient. One of the features is the carry bit. This bit is affected 
by all arithmetic operations of the accumulator, including addition and subtrac- 
tion with the accumulator buffer. This allows 32-bit-long arithmetic operations 
using the accumulator buffer as the second operand. 


The carry bit is also affected by the rotate and shift accumulator instructions. 
It may also be explicitly modified by the load status register ST1 and the set/re- 
set control bit instructions. For proper operation, the overflow mode bit should 
be reset (OVM = 0) so that the accumulator results will not be loaded with the 
saturation value. 


7.8.1 Addition and Subtraction 
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The carry bit is set whenever the addition of a value from the input scaling shift- 
er, the P register, or the accumulator buffer to the accumulator contents gener- 
ates a carry out of bit 31. Otherwise, the carry bit is reset because the carry 
outof bit31 is azero. One exception to this case is the addition to the accumula- 
tor with a shift of 16 instruction (ADD mem,16), which can only set the carry 
bit. This allows the ALU to generate a proper single carry when the addition 
either to the lower or the upper half of the accumulator actually causes the 
carry. The following examples help to demonstrate the significance of the carry 


_ bit of the TMS320C5x for additions: 
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In a similar way to addition, the carry bit on the TMS320C5x is reset whenever 
the input scaling shifter, the P register, or the accumulator buffer value sub- 
tracted from the accumulator contents generates a borrow into bit 31. Other- 
wise, the carry bit is set because no borrow into bit 31 is required. One excep- 
tion to this case is the SUB mem, 16 instruction, which can only reset the carry 
bit. This allows the generation of the proper single carry when the subtraction 
from either the lower or the upper half of the accumulator actually causes the 
borrow. The examples in Figure 7-2 demonstrate the significance of the carry 
bit for subtraction. 


Figure 7-2. 32-Bit Subtraction 


C MSB LSB C MSB LSB 

x 00000000 acc X 00000000 acc 
ee ae eee ee Ee 

O PF PPE EF EP 0 00000001 

Cc MSB LSB C MSB LSB 

X 7FFPFPEFEF F Acc X 7FFFFFEF F ACC 
Re aa a a ———EEFEFFEFEE 

1 7TFFFFFFE Cc 80000000 

C MSB LSB C MSB LSB 

Xx 80000000 acc Xx 8000000 0 Acc 
ee ao SE EE Pe Ee 

1 7 ESE EOE EB EF 0 80000001 

Cc MSB LSB C MSB LSB 

0 00000000 acc O FFFFFEF F ACC 
eee ee SUR) Ps a UB 
O FFFFFFFF 1 FFFFFFFE 

Cc MSB LSB C MSB LSB 

0 8s8O00O0FFFF ACC 0 S8O0O00FFFF ACC 
—=2.001 00 0 0 (SUB mem,16) _.-F FEF OO 0 0 (SUB mem,16) 
O T7TFFFFFFF 0 8O0O01FFFF 


Example 7-13 implements the subtraction of two 64-bit numbers on the 
TMS320C5x. A borrow is generated within the accumulator for each of the 
16-bit parts of the subtraction operation. 


Example 7-13. 64-Bit Subtraction 


eer e eee eer eeeeresreoere eee ere ese eeeseeeseesseesese reer erereeereereeeeeeoe 


c 
7 Two 64-bit numbers are subtracted, producing a 64-bit 
; result. The number Y (Y3,Y2,Y1,Y0) is subtracted from 
+ X (X3,X2,X1,X0) resulting in W (W3,W2,W1,W0). 

; If the result is required in 64-bit ACC/ACCB pair, 

3 replace the instructions as indicated in the comments 
i below. 

ce 

c 

‘ 


X3 X2 X1 X0 
; - Y3 Y2 Y1 YO 
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Figure 7-3. 16-Bit Integer Multiplication 


Signed Integer 


x Unsigned Integer 


Signed Multiplication 


Add X if Y 45 =1 


Final 32-Bit Result 


Steps Required: 
1) Multiply two operands X and Y as if they are signed integers, 


2) If MSB of the unsigned integer Y is 1, add X to the upper half of the 32-bit 
signed product. 


_ The correction factor must be added to the signed multiplication result because 
the bit weight of the MSB of any 16-bit unsigned integer is 215. 


Consider following representation of a signed integer X and an unsigned inte- 
ger Y: 


X = 219x465 + 214x444 213x493 +... + 2Ixy + 20XH 


Y= 21 yy5 4 214yy4 + 218yi9 4... 4 2lyy + Qo 


Multiplication of X and Y would yield: 
XxY = X x (215 yy5 4 214y44 4 218yi9 +... + Qlyy + Wyo) 


= 215y4eX + 214y44X + 213y43X Peco 21y4X + 20yoX (1) 


However, if X and Y are considered signed integers, their multiplication would 
yield: 


Xx¥ = Xx (-2yy5 + Q14y44 4 213 y49 +... + Qlyy + Qyo) 
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Extended-Precision Arithmetic 


The following example implements this algorithm. The product is a 64-bit inte- 
ger number. Note in particular, the use of BSAR and XC instructions. 


Example 7-14. 32-Bit Integer Multiplication 


title “32-bit Optimized Integer Multiplication” 
def MPY32 


eeoeoereeseese2 eoeoereer eee eee eer ee eee reer een eee eeseeoe ee 
i a ie a ae a a 2 2 2 2 2 22 2 2 2 2002020020000 0 0020222 


routine multiplies two 32-bit signed integers result- 
ing in a 64-bit product. The operands are fetched from 
data memory and the result is written back to data memory. 
Data Storage: 

X1,X0 32-bit operand 

~Y¥21,Y0 32-bit operand 

W3,W2,W1, WO 64-bit product 
Entry Conditions: 

DP = 6, SXM = 1° 


me Ve Me Ve Se Ve Ve Ve Ve Be We 


OVM = 0 
a i a a a OO 0 O22 

X1 -set 300h 7;DP=6 

xo eset 30l1h 7;DP=6 

Y1 -set 302h ;DP=6 

Yo -set 303h ;DP=6 

W3 .set 304h 7;DP=6 

W2 eset 305h ;DP=6 

W1 eset 306h 7;DP=6 

wo eset 307h 7;DP=6 

text 

MPY32: 
BIT X0,0 ;TC = XO bit#15 
LT XO 7T = XO 


MPYU Yo _#P = X0YO 
SPL wo ;Save WO 


SPH W1 7;Save partial Wl 

MPY Y1 7P = X0Y1 

LTP X1 7Acc = X0Y1, T = Xi 

MPY Yo 3P = X1Y0 

MPYA Yl zAcc = XOY1+X1Y0, P=X1Y1 
ADDS Wl zAcc = X0Y1+X1Y0+X0Y02*-16 
SACL W1 7Save final Wl 

BSAR 16 7Shift Acc right by 16 

XC 1,TC ;If MSB of XO is 1 


ADD Yl zAdd Yl 

BIT YO,0 ;TC = YO bit#15 

APAC sACC = X1Y1 + (XOY1+X1Y0)2*-16 
XC 1,TC ;IF MSB of YO is 1 

ADD Xl ;Add X1 

SACL W2 Save W2 

SACH W3 7Save W3 
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Integer and fractional division can be implemented with the SUBC instruction 
as shown in Example 7-16 and Example 7-17, respectively. When imple- 
menting a divide algorithm, it is important to know if the quotient can be repre- 
sented as a fraction and the degree of accuracy to which the quotient is to be 
computed. For integer division, the absolute value of the numerator must be 
greater than the absolute value of the denominator. For fractional division, the 
absolute value of the numerator must be less than the absolute value of the 


denominator. 


Long Division: 


0000000000000101 


SUBC Method: 


32 HIGH ACC __ 


| 

0000000000000000 
-10 
-10 


ID se 
0000000000000000 


-10 
-10 


| 
0000000000000100 


-10 
0000000000000001 
| 
0000000000000011 
~-10 
0000000000000000 


0000000000000001 
-10 
0000000000000011 


| Remainder 


000000000000110 
)000000000010001 
—101 
110 
—101 
11 


LOW ACC ___0 


0000000000100001 
1000000000000000 
041141141111011111 


0000000001000001 
1000000000000000 
01141111110111110 


0010000000000000 
1000000000000000 
1010000000000000 


0100000000000001 
1000000000000000 
1100000000000001 


1000000000000011 
1000000000000000 
— 11141411111111101 


0000000000000110 


| | Quotient 


Quotient 


Remainder 


(1) 


(2) 


(14) 


(15) 


(16) 


Comment 


Dividend is loaded into ACC. The di- 
visor is left-shifted 15 and subtracted 
from ACC. The subtraction is nega- 
tive, so discard the result and shift 
left the ACC one bit. 


2nd subtract produces negative an- 
swer, so discard result and shift ACC 
(dividend) left. 

e 

e 


14th SUBC command. The result is 
positive. Shift result left and replace 
LSB with 1. 


Result is again positive. Shift result 
left and replace LSB with 1. 


Last subtract. Negative answer, so 
discard result and shift ACC left. 


Answer reached after 16 SUBC in- 
structions. 
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Example 7-17. Fractional Division Using SUBC 
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RRR ETRE RETR TERETE TR ERTR RETR RTE RRR TEETER RETR TREE ERRTTEETTER ERE 
* This routine implements fractional division with the SUBC instruction. For 
* this division routine, the absolute value of the denominator must be 
* greater than the absolute value of the numerator. In addition, the 
* calling routine must check to verify that the divisor does not equal 0. 
* ‘ 
* The 16-bit dividend is placed in the high accumulator, and the low accumulator 
* is zeroed. The divisor is in data memory. 
bolt ee ee oe ee ee 2 
DENOM -set 60h 
NUMERA set 61h 
QUOT set 62h 
REM -set 63h 
TEMSGN .set 64h 
* 
FRACDIV LDP #0 . 
LT NUMERA + Determine sign of quotient. 
* 
MPY DENOM 
SPH TEMSGN 
LACL DENOM . 
ABS ; Make denominator and numerator positive. 
SACL DENOM 
LACC NUMERA,16 ; Load high accumulator, zero low accumulator. 
ABS 
* 
* If divisor and dividend are aligned, division can start here. 
* 


RPT #15 
SUBC DENOM 


l6-cycle division. Low accumulator contains 
the quotient and high accumulator contains the 
remainder at the end of the loop. 


me Se Ne 


BIT TEMSGN, 0 ; Test sign of quotient. 

RETCD NTC ; Return if sign positive, else continue. 
SACL QUOT ; Store quotient and remainder during delayed 
SACH REM ; return. 

LACL #0 ; If sign negative, negate quotient 

RETD 7; and return 

SUB QUOT 


SACL QUOT 
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7-32, 


-Point Arithmetic 


SAMM save the accumulator contents in a memory-mapped 


register 

LACB accumulator is loaded with contents of accumulator 
buffer 

SACB contents of accumulator are copied in accumulator 
buffer 


SATL accumulator is barrel-shifted right by the value 
specified in the 4 LSBs of TREG1 ~ 

SATH accumulator is barrel-shifted right by 16 bits 
if bit 4 of TREG1 is a one. 

SPLK store immediate long constant in data memory 

CPL compare long immediate value (or DBMR) with data 
memory 
TC=1 if two values are same 
TC=0 otherwise 


eevceee ee 


esoeveee eseeeee eooeecerecee eee eeereererereeerereeeereeeeeeeneeeeeereeeeseve 
A 2 2 2 2 2 A 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 


TREG1 -set Odh 


ASIGN -set 60h sSign, exponent, high and low part of mantissa 
AEXP -set 61h ;0f input number A 
AHT -set 62h 
ALO -set 63h 
BSIGN  .set 64h 7Sign, exponent, high and low part of mantissa 
BEXP -set 65h 7;0f input number B 
BHI -set 66h 
BLO -set 67h 
CSIGN .set 68h 7;Sign, exponent, high and low part of mantissa 
CEXP -set 69h 70of the resulting floating point number C 
CHI -set 6Ah : 
CLO -set 6Bh 
DIFFEXP .set 6Ch 
etext 
FL_ADD LDP #0 7Initialization 
SETC SXM 7Set sign extension mode 
MAR *, ARO ;ARP <- ARO 
LAR ARO, #0 #ARO is used by NORM instruction 
CMPEXP LACL BLO Load low Acc with BLO 
ADD BHI,16 7Add BHI to high Acc 
SACB 7AccB = BHIBLO 
LACC AEXP 
SUB BEXP 7Acc = AEXP=BEXP 
SACL DIFFEXP 7;Save the difference 
BCND AEQB, EQ s;If |Al == (IB 
BCND ALTB, LT 7If |All < (BI 
AGTB LACC DIFFEXP z;If {Al > IBI 
SAMM TREG1 ;Load TREG1 with # of right shifts reqd. 
SUB #32 
BCND AGRT32,GEQ ;If difference > 32 
LACB zAcc = BHIBLO 
SATL 
SATH 7Right justify BHIBLO 
SACB 7Store the result back in AccB 
AEQB LACC ASIGN #Copy sign and exponent values of 
SACL CSIGN 7A in C (i.e. the result) 


LACC AEXP 
SACL CEXP 


CHKSGN LACC ASIGN 7 Acc=ASIGN-BSIGN 
SUB BSIGN 
CLRC TC 7Clear TC flag 
XC 1,LT 7If A<O and B>0 
SETC TC 7;Set TC flag 


BCNDD ADNOW,EQ 7If both A and B have same sign 
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ADD AHI,16 3 Acc=AHIALO 
SATL 
SATH ;Right-justify ALOAHI 
BD CHKSGN 7Jump back after next two instructions 
SACL ALO ;Save normalized value 
SACH AHI sin ALO and AHI 

BGRT32 LACC BHI. ;If exponent of B > 32 
SACL CHI sthen C <- B. 
RETD ;Return after 
LACC BLO ;saving CHI and CLO 
SACL CLO 

AGRT32 LACC AHT s;If exponent of A > 32 
SACL CHI sthen C <- A. 
LACC ALO 
SACL CLO ;Copy ALO to CLO 
LACC ASIGN 
SACL CSIGN 7Copy ASIGN to CSIGN 
RETD 7;Return after 
LACC AEXP ;copying AEXP to CEXP 
SACL CEXP 


Example 7-19. Floating-Point Multiplication Using BSAR 
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+e + HH FH FF HH HF HH HH HHH SEH HKEHHHE HHH HEH HSE 


-title ‘Floating Point Multiplication Routine’ 


eoerereee eee ean eee eereeeseeese eo eree eee eee eee eee eee ere ereees 
[i 2 2 2 A 2 A 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 


SUBROUTINE MULTIPLIES TWO FLOATING-POINT NUMBERS PRODUCING 
A NORMALIZED FLOATING-POINT PRODUCT. THE FORMAT OF FLOATING- 
POINT NUMBERS IS SPECIFIED BELOW. 


we 
=e 
=e 


Hs. 
s.. 
He. 
nN. 


INPUT / OUTPUT FORMAT 


ee Gs sete om 
(TICS peo 
(0) THESE] mtn mae ov sss 
| 16BITS | LOW PART OF MANTISSA 


NOTE THAT EVEN IF THE PRODUCT IS ZERO, SIGN OF THE PRODUCT MAY 
EITHER BE POSITIVE OR NEGATIVE DEPENDING ON THE INPUTS. 


Key C5x Instructions: 

BSAR 1-16 bit right barrel arithmetic shift in one cycle 

CLRC reset control bit 

SETC set control bit 

BD branch after executing next two one-word instructions 
or one two-word instruction 


ASIGN -set 60h 
AEXP -set 61h 


7Sign, exponent, high and low parts of mantissa 
70f input number A 


AHI -set 62h 

ALO -set 63h 

BSIGN .set 64h ;Sign, exponent, high and low parts of mantissa 
BEXP -set 65h 7;0of input number B 

BHI -set 66h 

BLO -set 67h 
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7.10 Application-Oriented Operations 


7.10.1 Modem Application 


Digital signal processors are especially appropriate for modem applications. 
The TMS320C5x devices with their enhanced instruction set and reduced in- 
struction cycle time are particularly effective in implementing encoding and de- 
coding algorithms. Features like circular addressing, repeat block, and 
single-cycle barrel shift reduce the execution time of such routines. 


Example 7—20 implements a differential and convolutional encoder for a 9600- 
bit/s V.32 modem. This encoder uses trellis coding with 32 carrier states. The 
data stream to be transmitted is divided into groups of four consecutive data 
bits. The first two bits in time Q1, and Q2, in each group are differentially en- 
coded into Y1,, and Y2, according to the following equations: 


Y1n = Q1p ® Y1n-4 

Y2n = (Q1 n e Y1 n-1) ® Y2n-1 ® Q2, 
This is done by a subroutine called DIFF. The two differentially encoded bits 
Yin and Y2n are used as inputs to a convolutional encoder subroutine EN- 


CODE, which generates a redundant bit YOn. These five bits are packed into 
a single word by the PACK subroutine. 


Example 7-20. V.32 Encoder Using Accumulator Buffer 


etitle ‘Convolutional Encoding for a V.32 Modem’ 
-mmregs 
STATMEM -set 60h 7 (60h - 62h) Delay States S$1,S2,S3 
INPUT -set 64h 7 (64h - 67h) Four input. bits 
YPAST -set 68h 7 (68h - 69h) Past values of Y1 and Y2 
OUTPUT -set 63h 7Y0O, the redundant bit 
LOCATE -set 6ah ;Temporary storage for current input word 
PCKD_IP -set 1000h sInput buffer (4 bits packed per word) 
PCKD_OP .set 2000h ;Output buffer (5 bits packed per word) 
COUNT -set 50 7# of input data words 
etext 
INIT LAR AR1, #PCKD_IP 
LAR AR2, #PCKD_OP 
LAR AR3, #COUNT-1 ;COUNT contains # of input words 
LDP #0 
START MAR *, ARI 
LACC *+,0,AR0 
SACL LOCATE ;Temporary storage for current input word 
LAR ARO, #INPUT+3 , 
LACL #3 ;Loop 4 times 
SAMM BRCR 
LACL #1 
SAMM DBMR ;Load DBMR with the mask for LSB 
UNPACK LACC LOCATE 7Acc = packed input bits 
RPTB LOOP1-1 ;for I=0, I<=3, I++ 
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7.10.2 Adaptive Filtering 
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There are many practical applications of adaptive FIR/IIR filtering; one exam- 
ple is in the adapting or updating of coefficients. This can become computation- 
ally expensive and time-consuming. The MPYA, ZALR, and RPTB instructions 
on TMS320C5x can reduce execution time. 


A means of adapting the coefficients on the TMS320C5x is the least-mean- 
square algorithm given by the following equation: 


b, (i+ 1) = by (i) + 2Be(i)x(i-k) 


where e (i) = x (i) ~— y (i) 
and cst 


y(i) = >> by x(i-k) 


k=0 


! 


Quantization errors in the updated coefficients can be minimized if the result 
is obtained by rounding rather than truncating. For each coefficient in the filter 
at a given point in time, the factor 2*B*e(i) is a constant. This factor can then 
be computed once and stored in the T register for each of the updates. 


MPYA and ZALR instructions help in reducing the number of instructions in the 
main adaptation loop. Furthermore, the RPTB (repeat block) instruction allows 
the block of instructions to be repeated without any penalty for looping. 


Example 7-21 shows a routine that implements a 128-tap FIR filter and an 
LMS adaptation of its coefficients. The single-access internal RAM of 
TMS320C50/C51 can be mapped in both the program and data spaces at the 
same time by setting OVLY and RAM control flags to 1. This feature can be 
used to advantage by locating the coefficients table in single-access internal 
RAM so that it can be accessed by MACD and MPY instructions without modi- 
fying RAM configuration. Note that the MACD instruction requires one of its op- 
erands to be in program space. 


If the address of the coefficient table is to be determined in runtime, load the 
BMAR (block move address register) with the address computed dynamically 
and replace the instruction 


MACD  COEFFP,*- 
by 
MADD *— 
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7.10.3 IIR Filters 


Infinite impulse response (IIR) filters are widely used in digital signal process- 
ing applications. The transfer function of an IIR filter is given by: 


H(z) _ Do + b,z" +... + byz™ . Y(z) 
~ fear. tayz — X(z) 
An Nth order direct-form II IIR filter can be represented by the following block 
diagram: 
Figure 7-5. Nth Order Direct-Form Type II IIR Filter 


x(n) y(n) 


In the time domain, an Nth order IIR filter is represented by the following two 
difference equations: 


at time interval n: 
x(n) is the current input sample 
y(n) is the output of the IIR filter 
d(n) = x(n) —d(n—1)ay —... —d(n—-N+1)an_y 
y(n) = d(n)bo + d(n—1)b; +... + d(n—-N+1)byy_4 


The above two equations can easily be implemented on the TMS320C5x by 
using multiply-accumulate instructions (MAC, MACD, MADS, MADD). Note 
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Example 7-23. Using LTD and MPYA 


.title ”“N Cascaded BiQuad IIR Filters” 


-IMmregs 


eoeereceeer eee eee eae eee eeeeseeeseeeeeeeereeevoee 


; This routine implements N cascaded blocks of biquad IIR 


; canonic type II filters. 


Each biquad requires 3 data 


; Memory locations d(n),d(n-1),d(n-2), and 5 coefficients 


; ~al,-a2,b0,b1,b2. 


; For each block: d(n) = 


a 

c 

; Coefficients Storage: 
7 ~a2,-al,b2,b1,b0, 
i ist biquad Nth biquad 
; State Variables: 
; d(n),d(n-1),d(n-2), ... 
; Nth biquad 


; Entry Conditions: 
AR1 -> d(n-2) of lst biquad 
AR2 -> -a2 of lst biquad 
AR3 -> input sample (Q15 number) 
AR4 -> output sample or number) 


; DP = 0, PM = 0, ARP = 3 


eooeoereeec ese eco ec eee seer eeeese ee eeeseereeeeeeeeee 


x(n) -d(n-1) al-d(n-2) a2 
y(n) = d(n)b0+d(n-1)bl+d(n-2)b2 


(low to high data memory) 
,7a2,-al,b2,b1,b0 


(low to high data memory) 
,d(n),d(n-1),d(n-2) 
lst biquad 


corer er ee see eee er eeeense 


(A A 2 2 2 2 2 2 2 2 2 2 2 A A 2 2 2 


BIQUAD: ;Setup variables; 
ZPR 7 Clear P register 
LACC *,15,AR1 ; Get Q15 input 
SPLK #2, INDX ? 
SPLK #N-1,BRCR ; 


;Begin computation; 


Setup count 


Setup index register 


RPTB ELOOP-1 3 repeat for N biquads 
LOOP: 
LT *-, AR2 3; T = d(n-2) 
MPYA *+,AR1 3; Acc = x(n), P = -d(n-2) a2 
LTA *-,AR2. 3; Acc t= -d(n-2)a2, = d(n-1) 
MPY *+ 3; P = -d(n-1)al 
LTA *+,AR1 3 Acc += -d(n-1l)al, T = b2 
SACH *0+,1 3; Save d(n) 
MPY *- 3; P = d(n-2)b2 
LACL #0 3; Acc = 0 
LTD *-,AR2 3; T = d(n-1), d(n-2) = d(n-1) 
MPY *+,AR1 3 Acc += d(n-2)b2, P = d(n-1)bl 
LTD *-,AR2 3; T = a(n), a(n-1) = d(n) 
MPY *+,AR1 3; Acc += d(n-1)bl, P = d(n)b0o 
ELOOP: 
LTA *,AR4 ; Final accumulation 
SACH *,1 ; Save output in Q15 format 


7.10.4 Dynamic Programming 


Dynamic programming techniques are widely used in optimal search algo- 
rithms. Applications such as speech recognition, telecommunications, and ro- 
botics use dynamic programming algorithms. The TMS320C5x digital signal 
processors have an enhanced instruction set for efficient implementation of dy- 


namic programming methods. 


Most real-time search algorithms use the basic dynamic programming princi- 
ple that the final optima! path from the start state to the goal state always pass 
through an optimal path from the start state to an intermediate state. This helps 
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Figure 7-6. Backtracking With Path History 


State #0 


n-4 n-3 n-2 n—-1 n 


Path Trace for 5 Periods Current Time Period 


Buffer + 15 


Current Time Period 
Path History Circular Buffer (N = 4) 


Example 7-24. Backtracking Algorithm Using Circular Addressing 
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ecoeecereecseer cere eee eee eee eoeseeerseeseeoseeeeerseeeeeeeeesee2eeees 


A a A 2 A 2 2 2 A 2 2 2 2 i 2 2 2 2 2 2 2 2 2 2 a 2 A 2 2 
Backtracking Example 

This program back-tracks the optimal path expanded by 
a dynamic programming algorithm. The path history 
consists of four paths expanded N times. It is set up 
as a circular buffer of length N*4. 

Note that decrement type circular buffer is used. 

The start and end address of the circular buffer are 
initialized this way because of two reasons: 

l- to avoid skipping the end-address of circ buffer 
2- to ensure that wrap-around is complete before next 
iteration. 


eeoeoeter erro ese eee eee esr eee eee ee eee eeerseer eee eee eseeeereeeer eres 


i i A a 2 2 2 i 2 i 2 2 2 2 2 2 2 


LAR ARO, #BUFFER ; get buffer address 
LMMR INDX,PATH  ; get the selected path [0..3] 
SPLK #N~1,BRCR 7 trace back N time periods 
* init. ARO as pointer to circular buffer#1; length=N*4 words 
SPLK #BUFFER+ (N-1) *4,CBSR1 
SPLK #BUFFER-3, CBER1 
SPLK #08h,CBCR 


Re Ne Ne Ne Me Me Ye Ne Me Ne Me & 


. 


RPTB TLOOP~-1 for i=0,i<N,i++ 


MAR *O+ ; offset by state# 

LACC *Q- * get next pointer & reset to state#0 

SAMM INDX 3 Save next state# 

SBRK 3 ; decrement ARO to avoid skipping CBER1 

,SBRK 1 7 now ARO is correctly positioned 1 time 
TLOOP : # period back (circular addressing) 
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Figure 7-8. An In-Place DIT FFT With In-Order Inputs but Bit-Reversed Outputs 


x(0) 
x(4) 
x(2) 
x(6) 
x(1) 
x(5) 
x(3) 
x(7) 


Legend for twiddle factor: Wo = we W, = W 


Wo 
Wo 
Wo 


Wo 


Stage 1 Stage 2 Stage 3 


x(0) 


KD ° 
c/ Y | as 
‘< OT 
ry ry x(6 
VT™ PRE DT” 


2 
g Wa = Wg Ws = W 


Table 7-1. Bit-Reversal Algorithm for an 8-Point Radix-2 DIT FFT 
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| Index | Bit Pattern Bit-Reversed Pattern Bit-Reversed Index 


NOOR GNM =“ O 


The bit-reversed addressing mode is part of the indirect addressing implem- 
ented with the auxiliary registers and the associated arithmetic unit. In this 
mode, a value (index) contained in INDX is either added to or subtracted from 
the auxiliary register being pointed to by the ARP. However, the carry bit is not 
propogated in the forward direction; instead, it is propagated in the reverse di- 
rection. The result is a scrambling in the address access. 


The procedure for generating the bit-reversed address sequence is to load 
INDX with a value corresponding to one-half the length of the FFT and to load 
another auxiliary register—for example, AR1—with the base address of the 
data array. However, implementations of FFTs involve complex arithmetic; as 
a result, two data memory locations (one real and one imaginary) are asso- 
ciated with each data sample. For ease of addressing, the samples are stored 
in workspace memory in pairs with the real part in the even address locations 
and the imaginary part in the odd address locations. This means that the offset 
from the base address for any given sample is twice the sample index. If the 
incoming data is in the following form: 
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Example 7-25. Macros for 16-Point DIT FFT 


KKK KKK IKKE KKK KKK KKK KKK KK KKK KK KIKI KKK KKK IKKE KKK KKK KKK KKK KKK IKK KEKE KKK KKK KKKKKKK 
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* FILE: cScxrad2.mac 
* 


* COPYRIGHT TEXAS INSTRUMENTS INC. 1990 
FOI KIRK IIR IK II RK TOK II TOK IK IIR IIR IK KK I IK IK IKI RII KIKI IKARIA KAKI AK KARR KK RK KK 


* 


* MACRO ’COMBO2X’ FOR THE COMPLEX, RADIX~2 DIT FET 
* 


* ORGANIZATION OF THE INPUT DATA MEMORY: 
* 


--> macro file for radix 2 fft’s based on 320c5x 


R1,11,R2,12,R3,13,R4,14 


* 
* 
* 


FORK KIKRIKRHEK IH KRKEKK KKK IKKE KKEKKKKKEKKKKKKKEKKKKKKEMKAKKKKKKaKKKKKKKEKKKKKKKKHAKK KAKI 


THE 


Rl 
R2 
R3 
R4 
Tl 
I2 
I3 
r4 


Qs #£ e+e HHH HHH SA 


=e 


we =e 


we 


™e 


™e 


1eeod aad 


MACRO 


’ COMBO2x’ 


[ (R1+R2) +(R3+R4) 1/4 
[ (R1-R2) +(I3-14)]/4 
[ (R1+R2) ~(R3+R4)J/4 
[ (R1-R2) - (13-14) ]/4 
[ (11+1I2)+(I3+14)]/4 
[ (I1-12) - (R3-R4) J] /4 
{ (11+I2) —(13+24)]/4 
[ (I1-I2) + (R3-R4)]/4 


OMBOSx $MACRO 


times fora 


num 
SPLK #:num:-1,BRCR 
RPTB comboend ; 
LACC *,14,AR4 ; ACC 
SUB *,14,AR5 ; ACC 
SACH *+,1,AR4 ; T1 
ADD *+,15,AR5 ; ACC 
SACH *,1,AR2 Per? 
ADD *,14,AR1l ; ACC 
ADD *,14 3; ACC 
SACH *+,0,AR5 ; R1 
SUB *,16,AR3  ; ACC 
SACH *+,0,AR5 ; R3 
ADD *,15,AR2 ; ACC 
SUB *,15,AR3  ; ACC 
ADD *,14,AR4 ; ACC 
SUB *,14,AR2 ; ACC 
SACH *+,0,AR4 ; R2 
ADD *-,15,AR3 ; ACC 
SUB *,15,AR4 ; ACC 
SACH *+,0,AR1 ; R4 
LACC *,14,AR2 ; ACC 
SUB *,14,AR5 ; ACC 
SACH *,1,AR2 ; T2 
ADD *,15,AR3  ; ACC 
ADD *,14,AR4 ; ACC 
ADD *,14,AR1 ; ACC 
SACH *0+,0,AR3 ; Il 
SUB *,15,AR4 ; ACC 
SUB *,15,AR3  ; ACC 
SACH *0+,0,AR5 ; I3 
LACC k—,15 7; ACC 


tow wo 


oe 00 08 ce @0 eo ce of 


ae oe ce 06 #6 00 80 0 of of 


we 


Vadirn non a dt 


PERFORMS FOLLOWING CALCULATIONS: 


ARO = 7 

AR1 -> R1,I1 

AR2 -> R2,12 

ARP-> AR3 -> R3,I3 
AR4 -> R4,14 


OUTPUT 
AR1 - > R5,1I5 
AR2 - > R6,16 
ARP - > AR3 ~- > R7,17 
AR4 - > R8,I18 
to be 


16 point FFT). 


- For a 16-point Radix 2 complex FFT the Macro ’COMBO2:’ has 
repeated N/4 times (e.g. 4 


REPEAT MACRO ’COMBO5x’: N/4 times 


execute ‘num’ 


(R3) /4 
(R3-R4) /4 
(R3-R4) /2 


(R3+R4) /4 
(R3+R4) /2 


(R2+R3+R4) /4 
(R1+R2+R3+R4) /4 
(R1+R2+R3+R4) /4 
(R1+R2-(R3+R4)) /4 
(R1+R2- (R3+R4))/4 


(R1+R2) /4 

(R1-R2) /4 

((R1-R2) +(I3))/4 

( (R1-R2) + (I3-14))/4 
((R1-R2) +(I3-14))/4 
((R1-R2)+ I34+1I4 )/4 
((R1-R2) - (13-14) )/4 
((R1-R2) -(I3-14))/4 


(I1)/4 

(I1-I2) /4 

(I1-I2) /2 
((I1+1I2))/4 
((I1+I2)+(1I3))/4 
((I1+1I2)+(13+14))/4 
((I1+1I2)+(I3+14))/4 
((I1+I2)-(I3+I4))/4 
((I1+I2)-(I3+14))/4 
((I1+I2)-(I3+1I4))/4 


(11-12) /4 


times ’COMBO5:’ 


RRR III IK KKK KK RI RII IO RRR IO RIOR ROI ROR RR RIO III A RR RR RR RRR KR RK IK 


* 


+H HHH EE HEH 


ARP AR1 AR2 AR3 AR4 ARS 


4 
5 
4 


Noo 


HP hWAND BW DY nw oe 


(S3) OWbBWeE & BHM Oh 


R2° R3 
R2 R3 
R2 I3 
R2. R3 
R2 R3 
R2 R3 
R2 R3 
R2 R3 
R2 R3 
R2 13 
R2 13 
R2 13 
R2 I3 
R2 13 
I2 13 
I2 13 
I2 13 
I2 13 
I2) 13 
I2 13 
I2 13 
I2 13 
I2 13 
I2 13 
I2 13 
I2 13 
I2 13 
I2 R7 
I2  R7 


R4 
R4 
R4 


I4 
I4 


I4 
I4 
r4 
I4 
I4 


r4 
I4 
r4 
I4 
I4 
R4 
R4 
I4 


I4 
I4 
I4 
I4 
I4 
r4 
r4 
I4 
I4 
I4 


I4 


Tl 
Tl 
T2 


T2 
T2 


T2 
T2 
T2 
T2 
T2 


T2 
T2 
T2 
T2 
T2 
T2 
T2 
T2 


T2 
T2 
T2 
T2 
T2 
T2 
T2 
T2 
T2 
T2 


Tl 
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* QR’= PR - (W*QI + W*QR) = PR - W* QI - W* QR. (<- AR2) * 
* PI’= PI + (W*QI - W*QR) = PI + W* QI - W* OR (<- AR1+1) * 
* QI’= PI — (W*QI - W*OR) = PI - W* QI + W* QR (<- AR1+2) * 
* * 
KEKE KKEKKKKEKKEKEKK KKK EK KKK KKK KKK KR KKK KEKE KKKEKKKKKKKKKKKKKKKKRKKKKRKKRKKK 
PBY4I SMACRO 3; TREG= W AR5 PREG AR1 AR2 ARP 
MPY *+,AR5 3; PREG= W*QR/2 - W*QR/2 PR QI 5 
SPH *, AR1 3 TMP = W*QR/2 W*QR/2 W*QR/2 PR QI 1 
LACC *,15,AR2 ; ACC = PR/2 W*QR/2 W*QR/2 PR QI 2 
MPYS ke 3; ACC = (PR-W*QR) /2 W*QR/2 W*QI/2 PR QR 2 
SPAC 7 ACC = (PR-W*QI-W*QR)/2 W*QR/2 W*QI/2 PR QR 2 
SACH *+,0,AR1 ; QR = (PR-W*QI-W*QR)/2 W*QR/2 W*QI/2 PR QI 1 
SUB *,16 ; ACC = (-PR-W*QI-W*QR) /2 W*QR/2 W*QI/2 PR QI 1 
NEG 3; ACC = (PR+W*QI+W*QR) /2 W*QR/2 W*QI/2 PR QI 1 
SACH w+ 3; QR = (PR+W*QI+W*QR) /2 W*QR/2 W*QI/2 PI QI 1 
LACC *,15,AR5 ; ACC = (PI) /2 W*QR/2 W*QI/2 PI QI 5 
SPAC 3; ACC = (PI-W*QI) /2 W*QR/2 - PI QI 5 
ADD *,16,AR2 ; ACC = (PI-W*QI+W*QR) /2 - - PI QI 2 
SACH *+,0,AR1L ; QI = (PI-W*QI+W*QR) /2 - - PI QRi 1 
SUB *,16 3 ACCU= (-PI-W*QI+W*QR) /2 = - PI QR1 1 
NEG 3 ACCU= (PI+W*QI-W*QR) /2 - - PI QR1 1 
SACH *+,0,AR2 ; PI = (PI+W*QI-W*QR) /2 = = PR1 QR1 2 
SENDM 
* 
KRKKKKKKKEKKKKE KHER KEKE KKK KKK KKK KKK KKK KK RRR KKK KKK 
* : * 
* MACRO ’P3BY4J’ number of words: 16 * 
* * 
* ENTRANCE IN THE MACRO: ARP=AR2 * 
* AR1->PR,PI * 
_* AR2->0R,QI * 
* TREG=W=COS (45) =SIN (45) * 
* * 
* PR’= PR + (W*QI - W*QR) = PR + W * QI - W * OR (<- AR1) * 
* QR’= PR - (W*QI - W*QR) = PR - W* QI + W* OR (<- AR2) * 
* PI’= PI — (W*QI + W*QR) = PI - W* QI - W* OR (<- AR1+1) * 
* QI’= PI + (W*QI + W*QR) = PI + W * QI + W* QR (<- AR1+2) * 
* * 
* EXIT OF THE MACRO: ARP=AR2 * 
* AR1->PRt+1,PI+1 * 
* AR2->QR+1, QI+1 * 
* * 
KKEKEKKKEKEKK KEKE KEK KKK KEK KKK RR KEKE KK RRR KEKE KKK KERR KEK RRR KKK RE KKK ERK KKKKKKKERKEE 
P3BY4I SMACRO ; TREG= W ARS PREG AR1 AR2 ARP 
MPY *+, AR5 3; PREG= W*QR/2 - W*QR/2 PR QI 5 
SPH *, ARL 3 TMP = W*QR/2 W*QR/2 W*QR/2 PR QI 1 
LACC *,15,AR2 3; ACC = PR/2 W*QR/2 W*QR/2 PR QI 2 
MPYA *- 3; ACC = (PR+W*QR)/2 W*QR/2 W*QI/2 PR QR 2 
SPAC 3 ACC = (PR-W*QI+W*QR)/2 W*QR/2 W*QI/2 PR QR 2 
SACH *+,0,AR1 3 OR’ = (PR-W*QI+W*QR)/2 W*QR/2 W*QI/2 PR QI 1 
SUB *,16 3 ACC = (-PR-W*QI+W*QR) /2 W*QR/2 W*QI/2 PR QI 1 
NEG 3; ACC = (PR+tW*QI-W*QR)/2 W*QR/2 W*QI/2 PR QI 1 
SACH e+ ; PR’ = (PR+W*QI-W*QR)/2 W*QR/2 W*QI/2 PI QI 1 
LACC *,15,AR5 3; ACC = (PI)/2 W*QR/2 W*QI/2 PI QI 5 
| APAC 3; ACC = (PI+W*QI) /2 W*QR/2 - PI QI 5 
ADD *,16,AR2 3 ACC = (PI+W*QI+W*QR) /2 - = PI QI 2 
SACH *0+,0,AR1 ; QI’ = (PI+W*QI+W*QR) /2 - - PI QR5 1 
SUB *,16 3; ACCU= (-PI+W*QI+W*QR) /2 = = PI QR5 1 
NEG 3 ACCU= (PI-W*QI-W*QR)/2 . - = PI QR5 1 
SACH *0+,0,AR2 ; PI’ = (PI-W*QI-W*QR) /2 - - PR5 QR5 2 
SENDM 
SEKELAERERAARRERERAK EAR RRHD A REDE RERESE EAD EAEKERER EA AAREAEEEE SEDER TRAE RR EEESSE 
* * 
> MACRO ’stage3’ number of words: 54 * 
* * 
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Example 7-26. Initialization Routine 


-bss 
-bss 
-bss 
-bss 
-bss 
-bss 


Temp variables 


<e te 


-bss 


=e 


-sect 
B 


-sect 
TABINIT: word 

-word 

-word 
TABEND: eset 


c 

INIT: LDP 
SPM 
CLRC 
SETC 
SPLK 


INIT Block B2 


Re te Ne 


LAR 
LACC 
RPT 
TBLR 


Re te Ne 


LAR 
LACC 
RPT 
TBLR 


EXECUTE THE FFT 


we te Ne 


LAR 
CALL 
WAIT RET 
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NN,1 
NN2Z,1 
DATAADD, 1 
cos45,1 
sin4,1 
cos4,1 


TEMP ,2 


“vectors” 
INIT, *, ARO 


wr in it “ 
N,N-1,2*N-1,DATA 
5A82h 

TWID, TWID+4 

$ 


#0 7 use only 
0 

OVM 

SXM 
#pmstmask, PMST 


ARO, #NN 
#TABINIT 
#TABEND-TABINIT 
+ 


INIT TWIDDLE FACTORS 


ARO, #TWID 
#TWIDSTRT 
#TWIDLEN 
x4 


ARS, #TEMP 
FFT, *,AR3 


# number of fft-points 
3 2*N-1 
; START ADDRESS OF DATA 


3; start of sine in stage 4 
7 Start of cosine in stage 4 


we 


used for temporary numbers 


cos (45) =sin (45) 


= 


B2 and mmregs for direct addressing 
no shift from PREG. to ALU 
disable overflowmode 

enable sign extension mode 
ndx=trm=1 


Me Ne Ne 


=e 


arp is already pointing to ar0 


=e 


arp is already pointing to ar0 


Ne 


pointer to 2 temp register 
ARP=AR3 FOR MACRO COMBO 


we Ne 


; Return 
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Hr] we we Ne 


e 
, 
° 
, 
e 
, 
s 


TAGE1L: 


FFT 


END: 
FFTLEN 


-sect 


FFT CODE for STAGES 1 and 


SPLK 
LAR 
LAR 
LAR 
LAR 
COMBOSX 4 


SPLK 
LAR 
LAR 
stage3 


CODE FOR STAGE 4 


SPLK 
LAR 
LAR 
LAR 
LAR 
SPLK 
ZEROIL 
BUTTFLYI 
RET 
-set 
.set 
.end 


“fftprogram” 


AR3, DATAADD ; 


me Ne 


#INPUT, *BRO+ 


#7, INDX 


AR1, DATAADD 
AR2, #DATA+2 
AR3, #DATA+4 
AR4, #DATA+6 


#9, INDX 


AR1, DATAADD 
AR2, #DATA+8 


2 


#1, INDX 


AR1,DATAADD 
AR2, #DATA+16 


AR3,cos4 
AR4,sin4 
#6,BRCR 


$ 
END-FFT+1 


to 


me Se Me Ne Me Ne 


CODE FOR STAGE 3 / ARP=AR2 


me Me Me Ne 


/ ARP=ARP 


me Me Me Ne Ne 


“=e Me 


LEE ISLE EEL EL ELL SERRE AEE LLL EERE ENE LEE LEER SEED TEE 


FFT CODE WITH BIT-REVERSED INPUT SAMPLES / ARP=AR3 


TRANSFER 32 WORDS FROM ’input’ to ‘’data’ 


indesmregister = N 
N TIMES 


indexregister = 7 


pointer to DATA r1,il 
pointer to DATA + 2 r2,12 
pointer to DATA + 4 r3,13 
pointer to DATA + 6 r4,i4 


repeat 4 times 


index register = 9 
arl -> DATA 

ar2 -> DATA+8 
repeat 2 times 
index register = 1 


start of cosine in stage 4 
start of sine in stage 4 


execute ZEROI 
execute 7 times BUTTFLYI 


Software Applications 


A Electrical Specifications 


Pinout 


A.1 Pinout and Signal Descriptions 


Figure A-1. TMS320C50/C51 Pinout 


132-Pin 
Quad Flat Pack Packaget 
(Top View) 


= 
129 128 127.126 12 


~~ On 


51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 


L__ LJ {__J 
OQERVERLQELLELAARG ZOSESELL ELS Aaeey 
22 A ee ee ee Be2Q2gegee22 8eee 

o> >> 7D & > 2 
rs) : 


Tt See Pin Assignments, Table A-1 (page A-3) for location and description of all pins. The TMS320C50 


and TMS320C51 will be packaged in 132-pin plastic QFP in production. See Figure A-18 for mechanical 
data. 


Note: NC = No connect. (These pins are reserved.) 
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Table A-1._ TMS320C50/C51 Pin Assignments (Continued) 


a 
SS 
[36 [vss | Supply | Ground —SSSCSC—~—~—SCS 
a 
ee dort SSCSCSCS~S 
ec 


‘Nonm 


Parallel Port Address Bus 
(10 pins) 


Tt NC = No connect 
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Pinout and Signal | Descriptions 


Table A-1. TMS320C50/C51 Pins (Concluded) 


[Pin [Name Te Ceserptlon 
[tor —| ves ——=«dY=SSuply =| «Gourd SSC~—~—SCS 
a Se Oe 
P tos [ ouxwpa [1 —~*;s« lock Mode Ping 
Sa ee aa 
—108_} TESKIFAM | 1OZ __| Saal Eom 2 irenstnities Frame Sync 


108 — Ror 
[109 
[10 | ouKouT 


"Te 
“TFSAVTADD 
Trunk [1a Por Ranaber oak 
[Ts [1 evicorest SS SSCSC~* 
(i a 
‘(0a iatvornSSCS~S 
[rst op soy 
a 


T NC =No connect 
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Electrical Characteristics and Operating Conditions == iwNWw... 


Table A-4. Electrical Characteristics Over Specified Free-Air Temperature Range (Unless Otherwise Noted) 


Iz Three-state current 
I Input current 
(Vi=Vgs to Vpp) 


Ippc Supply current, Core CPU 
lpbpp Supply current, pins 


lpp Supply current, 
power down modes 


C; Input capacitance 
Co Output capacitance 


X2CLKIN pin 
All other input only pins 

Operating Ta=0°C, Vpp=5.25 V, fy=40:96 MHz 
Operating Ta=0°C, Vpp=5.25 V, fy=40.96 MHz 


Allinput and output voltage levels a 
reference levels. 


=A OM +--+ 


Output 
Under 
Test 


Where: Io. = 2.0 mA (all outputs) 
IOH = 300 pA (all outputs) 
VLOAD = 2.15 V 
CT = 80 pF typical load circuit capacitance. 
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Clock Characteristics and Timing 


A.3 Clock Characteristics and Timing 


The TMS320C50/C51 can use either its internal oscillator or an external fre- 
quency source for a clock. The clock mode is determined by the CLKMD1 (pin 
71) and CLKMD2 (pin 103) clock mode pins. The following table outlines the 
selection of the clock mode by these pins. 


CLKMD1 | CLKMD2 Clock Source 
pee ae. | 0 External divide-by-one glock a 


A.3.1 


ies resistance of 30 ohms anda power 
ecified at a load rear of 20 a 


Test Conditions 
Uo to 70°C 40.96 


al state of the processor when fy = 0 Hz, the input clock can be stopped only when both CLKIN and 
1.1 IDLE2 mode, clocks are guaranteed to be stopped properly internal to the device. Therefore, in IDLE2 
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A.3.3 External Divide-by-One Clock Option 


An external frequency source can be used by injecting the frequency directly 
into CLKIN2, with X1 left unconnected and X2 connected to Vpp. This external 
frequency is divided by one to generate the internal machine cycle. The divide- 
by-one option is used when the CLKMD1 pin is strapped high and CLKMD2 
is strapped low. 


The external frequency injected must conform to specifications listed i 
ing requirements table. 


Table A-9. Timing Requirements Over Rec 


Parameter 


haracterization data and not tested. 
Ate of the processor when f, = 0 Hz, the input clock can be stopped only when both CLKIN2 and 


| 
| | w(CIL) {>| | 
ou SANSA LAT ALAIN IAIN PRLS 


[ | 
ta(CIH-CO) I | | | 
eee —— | | kK twcoL! | | tr(CO) : 


| | 
cuxours Yn natal TN \_/NYNSNSNINSN 
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Clock Characteristics and Timing 


Figure A-7. Memory and Parallel I/O Interface Read and Write Timing — 


| | Ji aes a a 


> ten(D)A 
RAW oe ta(A) aoe \ : | | / 
ta(R) -#—$>} : . 


tsu(D)R 4+ 


DATA 


| - ten(D)WR —> 
paar tw(SH)R 


I~ tw(SL)R | 


tw(SL)W 


| | 
'—— twSH)w ——> 


a s always require two cycles to prevent external bus conflicts. 
ad: and a two-cycle write and is not drawn to scale. All external writes 
id or immediately followed by an external read require three machine cycles. 
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Clock Characteristics and Timing - rte. 


Figure A-9. Ready Timing for Externally Generated Wait States During an External Write Cycle 


CLKOUT! | | | | | | | 


ADDRESS 


tsu(R-CO) —4-—>| 


Wait State Generated by READY. 
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A.3.8 Instruction Acquisition (IAQ), Interrupt Acknowledge (IACK), and External 


Flag (XF) Timings 


Table A-15, Switching Characteristics Over Recommended Operating Conditions (H = 0.5¢/co)) 


a 2 

tsu(A)IAG __Seluptime, address validbeforetGiowt (| H-8 | rs 
" A)IAQ Hold time, aiciess vale after [AQ high 

*o IAQH TAQ high pulse duration 

tsu(A)IACK Selle time, address vale before IA IACK low 


Mine = 
| 
| 


-—______—_ Vic 
a 


EE cee 


[<< —___—_—_—_——- tw(IACKH) ——--—--_-__—__> 


| 
ee td(XF) 
XF \ 
Note: 1AQ and IACK are not affected by wait states. 
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Figure A-12. External DMA Timing 


HOLD \ 4 
. | 
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Clock Characteristics and Timing 


A.3.11 Serial Port Transmit Timing With External Clocks and Frames 


Table A-19. Switching Characteristics Over Recommended Operating Conditions (S = 0.5¢/scK)) 


tsu(DX Setup time, DX valid before CLKX falling 
thiDX Hold time, DX valid after CLKX falling 


ani 
e(Scx) _ Seralpartciockaysetime ——SCS~—~—CSCSCSCSSSCS*~“*~*‘“dRCS CCS 
[ns | 
[iqS0K) Seal portclockrisetime SSCS Ys 
[ns _| 
Fed 
Tas | 
[ns] 


i att ar 
FtwiSCk) ___Serialport clock low/high pulse duration __————=«d(L_@vM SCS 
fines) FSX hold time after CLKXtalingedge —=S~C~C~idSiaSCSC~C~CS~S~S~SCS 
th(FS)H FSX hold time after CLKX rising edge 


Tt Ifthe FSX pulse does not meet this specification, the first bit of serial data will be driven on the DX pin until the falling edge 
of FSX. After the falling edge of FSX, data will be shifted out on the DX pin. The transmit buffer empty interrupt will be generated 
when the th(RS) and th(RS)H Specification is met. 


Figure A-14. Serial Port Transmit Timing With External Clocks and Frames 


ke—— tovSCK) —* 
is "e(SCK) + et 
| *— > twscx) coe 
t | | le- 
ae) aa 3 th(FS)H ; | ere rech) 
h(FS) —-—-+«——> | 
t 
_ XX (*—>+ twSck) ——— 
tsu(Dx)-4——>1 
! ace th(DX) 
1 2 7/15 8/16 
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A.3.13 Serial Port Receive Timing in TDM Mode 


Table A-22. Timing Requirements Over Recommended Operating Conditions (H = 0.5¢/¢0)) 


Pe ae 


Tt These parameters apply only to the first bits in the 
+ FSX timing and waveforms shown in Figure A-16 are| 
nal case is illustrated in the transmit tim 
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Clock Characteristics and Timing 


A.3.15 Timer Output 


Table A-25. Switching Characteristics Over Recommended Operating Conditions (H = 0.5¢/co)) 


a 
jaroUT) TOUT pase drain i 


Figure A-18, Timer Output 
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External Interface Timing 


Figure B~1. Memory Interface Operation for Read-Read-Write (0 Wait State) 
| | | | | . | 
cKourr NX ON TNS NST NS NS VS - 
| | | | | | 


ADDRESS ) } 

| | | | | l 
DATA | Write Data ! 
| | | 

| | 
RW | | l | | 
| | | | 

| | | | 

By l 

RD | | | 


| | ; | 

1-Cycle Read ——_lq—______»} | | 
| lq 3-Cycle Write ————_—_p| 

1-Cycle Read ————_ P| 
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Figure B-3. Memory Interface Operation for Read-Write (1 Wait State) 


| | | | | | 
| | | | | 
ADDRESS 


| | 
DATA Read Write Data | 
| | | | | 


| 
| Two-cycle Read | 
le with one Ready- | 


generated > 


wait state | 


k¢———-  Three-cycle write ———>»l 
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C TMS320C5x System Migration 


C.1 Package and Pin Layout 


C-2 


The TMS320C25 is available in both a 68-pin CPGA and a 68-pin PLCC as 
shown in Figure C—1 and Figure C—2, respectively. The TMS320C50 and 
TMS320C51 are packaged in a 132-pin Quad Flat Pack package (QFP). This 
package is shown in Appendix A. 


TMS320C5x System Migration 


Package and Pin Layout 


Figure C-2. TMS320C25 68-Pin Plastic Leaded Chip Carrier 


1,35 (0.053) 
1,19 (0.047) 


x 45° 


2,79 (0.110) 
2,41 (0.095) 


4,50 (0.177) 
4,24 (0.167) 


23,62 (0.930) 


23,11 (0.910) 
(At Seating Plane) 


1,27 (0.050) T.P. 
(See Note B) 


\_ 0,94 (0.037) 


0,69 (0.027) 
Seating ain 


Thermal Resistance Characteristics 


Parameter 


Junction-to-free-air 5 
ReJA thermal resistance | 48 [ow 
1 [row 


R Junction-to-case 
6JC thermal resistance 


0,25 (0.010) R Max 
in 3 places 


24,33 (0.956) 


24,13 (0.950) 
| (see Note A) 


ee a ee OS cy ne ee ee ae re ee ee Be Oe 


25,27 (0.995) 
25,02 (0.985) 


24,33 (0.956) 


24,13 (0.950) 
(see Note A) 


| 25,27 (0.995) 
25,02 (0.985) 


1,22 (0.048) 
1,07 (0.042) 


1,52 (0.060) Min 


0,81 (0.032) 
0,66 (0.026) > _ 
+ 0,64 
| (0.025) 
Min 
| . 
|. 0,51 (0.020) 


0,36 (0.014) 


Lead Detail 


Notes: A. Centerline of center pin, each side, is within 0,10 (0.004) of package centerline as determined by this dimension. 
B. Location of each pin is within 0,127 (0.005) of true position with respect to center pin on each side. 


ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES. 


C-4 TMS320C5x System Migration 


Blt dla AA Alc he LO , , Se cae ae nae 


Only two TMS320C25 signals (CLKOUT2 and SYNC) are not present on the 
TMS320C5x. Because the TMS320C5x operates with a divide-by-two clock, 
it can be synchronized with reset. Therefore, there is no need for the SYNC 
signal. With only two phases, there are no external timings that tie to the 
CLKOUT2 of the TMS320C25. 


Some of the TMS320C25-equivalent pins have additional capabilities on the 
TMS320C5x. The TMS320C5x supports external direct memory access of the 
on-chip single-access RAM block. For this reason, the following signals are 
now bidirectional: 


AQ-A15 = address lines 
STRB = memory access strobe 
R/W = read/write 
BR = bus request 


The TMS320C5x serial port transmit clock (CLKX) can now be configured as 
an output that operates at one-fourth the machine clock rate. CLKX is confi- 
gured as an input by reset. The TMS320C25 CLKX pin is always an input. 


The TMS320C25 operates with a four-phase clock. This device’s machine rate 
is one-fourth the CLKIN rate. CLKOUT1 and CLKOUT2 operate at the machine 
rate and are 90° out of phase. The TMS320C5x operates with a two-phase 
clock. The device’s machine rate is one-half the CLKIN rate. In addition, the 
TMS320C5x offers a divide-by-one clock input feature so that the device’s ma- 
chine rate equals the CLKIN rate. CLKOUT1 operates at the machine rate. 
Figure C-4 shows both the TMS320C25 and the TMS320C5x clocking 
schemes. 


Figure C-4. TMS320C25 and TMS320C5x Clocking Schemes 
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C.2 Timing 


The TMS320C25 and the TMS320C5x operate with some timing differences. 
These timing differences include aspects of the on-chip operation as well as 
aspects of the external memory interfacing. One key difference is that the 
TMS320C5x is capable of operating at two to three times the speed of a 
TMS320C25. Another key difference is that the TMS320C25 operates with a 
three-deep pipeline, while the TMS320C5x operates with a four-deep pipeline. 
Key differences in the external memory interface encompass the faster 
TMS320C5x and include certain external interface enhancements. The final 
key difference is that some compatible operations execute in a different num- 
ber of machine cycles. This section describes these differences. 


C.2.1 Device Clock Speed 


C.2.2 Pipeline 


The TMS320C25 operates its machine cycles with a divide-by-four clocking 
scheme. The TMS320C5x uses a divide-by-two clocking scheme. This means 
that a TMS320C25, operating with a 40-MHz CLKIN, executes its machine 
cycles within 100 ns, while the TMS320C5x, which is operating with the same 
CLKIN, executes its machine cycles in 50 ns. This clocking arrangement 
changes the way that the signals of the devices are specified. Many of the 
TMS320C25 timing values, given in the TMS320 Second-Generation Digital 
Signal Processor Data Sheet, are specified as quarter-phase (Q)+Nons. The 
timing values of the TMS320C5x are defined in half-phases (H). 


The TMS320C25 operates with a three-deep pipeline, while the TMS320C5x 
onerates with a four-deen nineline. This means that anytime thereis aprogram 
counter (PC) discontinuity (for example, branch, call, return, interrupt, etc.), 
it takes four cycles to complete with the TMS320C5x, whereas it takes three 
cycles on the TMS320C25. The TMS320C5x, however, also has delayed in- 


structions that take only two cycles to complete. 


C.2.3 External Memory Interfacing 


C-8 


The TMS320C5x is designed to execute external memory operations with the 
same signals as the TMS320C25. As mentioned above, the TMS320C5x oper- 
ates at twice the instruction rate of the TMS320C25 when both operate with 
the same input clock. The TMS320C5x uses its software wait-state generators 
to compensate for this interface difference. The TMS320C5x device, operating 
with one software wait state, has similar memory timing to the TMS320C25 op- 
erating with no wait states. However, external writes require two cycles on the 
TMS320C5xdevices. The exact timing of the signals differ because of the more 
advanced process used with the TMS320C5x. 


The TMS320C5x has two additional memory interface signals to reduce the 
amount of external interfacing circuitries. The RD signal can be used to inter- 
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C.3 Instruction Set 


The TMS320C5x instruction set is a superset of the TMS320C25 instruction 
set. The instruction set of the TMS320C25 is upward source-code compatible. 
This means that all of the instruction features of the TMS320C25, implemented 
and code written for the TMS320C25, can be reassembled to run on the 
TMS320C5x. . 


The serial port mode control bits have been moved from the status registers 
to the serial port control register. Because they are no longer part of the CPU 
registers, they no longer have direct instructions to set or clear them. The bits 
of the SPC can be manipulated easily with the PLU instructions. The following 
table shows the instructions used to replace the serial port instructions (note 
that the data page pointer must be set to zero to execute these new instruc- 
tions): 


TMS320C25 TMS320C5x 


#0FFFFh,SPC 
#8,SPC 
#0FFFDh,SPC 


#2,SPC 
#0FFFBh,SPC 
#4,SPC 


~ Note that any or all three bits can be set in one execution of the OPL instruction, 
while any or all three bits can be cleared using the APL. The bits can be toggled 
with the XPL instruction. The I/O ports of the device are addressable in data 
memory space on the TMS320C5x devices. This means any instruction that 
can address data memory can also address the I/O ports. 


There are a number of new instructions on the TMS320C5x devices. These 
instructions provide a more orthogonal addressing scheme and exercise the 
new CPU enhancements. In order to simplify the description of the instruction 
set, a number of different instructions are combined into single new instruc- 
tions with additional operand formats, as in this example: 


TMS320C25 TMS320C5x 


ADD “+ ADD *+ 


ADDK OFFh ADD #0FFh 
ADLK OFFFFh ADD #0FFFFh 
ADDH ‘+ ADD *+,16 


Refer to Chapter 4 for the detailed discussion of the instruction set. 


The IDLE instruction, when executed, stops the CPU from fetching and execut- 
ing instructions until an unmasked interrupt occurs. The TMS320C25 automat- 
ically enables the interrupts globally with the execution of the IDLE instruction; 
this saves the extra instruction word/cycle required to execute the EINT (en- 
able interrupts globally) instruction. Upon receipt of the interrupt, the 
TMS320C25 executes the interrupt vector-‘and resumes operations. The 
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C.4 On-Chip Peripheral Interfacing 


The TMS320C5x has more peripherals than the TMS320C25; many 
TMS320C5x peripherals are enhancements of the TMS320C25 peripherals. 
The TMS320C25 has three peripheral circuits: serial port, timer, and 16 I/O 
ports. In addition to these peripherals, the TMS320C5x has software wait 


‘states and a divide-by-one clock. 


The serial port of the TMS320C5x has been enhanced in that the CLKX pin can 
be configured as either an input or an output (CLKX is always an input on the 
TMS320C25). CLKX is configured as an input upon a device reset to maintain 
compatibility with the TMS320C25. The new serial port status bits are now 
mapped to a memory-mapped register that is used exclusively for the serial 
port. The serial port modes are no longer controlled via status register 1. There- 
fore, serial port modes that are changed by using LST1 instruction will no long- 
er work. The mode bits must be set/reset via the serial port control register 


. (SPC). The data transmit (DXR) and data receive (DRR) registers have been 


moved in the memory map from locations 1 and 0 to 33 and 32, respectively. 


The timer has been enhanced onthe TMS320C5x to include a divide-down fac- 
tor of 1 to 17 and can be stopped or reset via software. These additional fea- 
tures are controlled via the timer control register (TCR). Upon reset, the di- 
vide-down factor is set to 1, and the timer is enabled to maintain compatibility 
with the TMS320C25. The timer (TIM) and period (PRD) registers have been 
moved in the memory map from locations 2 and 3 to locations 36 and 37, re- 
spectively. | 


The 16 input/output ports of the TMS320C5x are addressable in the data 
memory space. This allows direct access of the I/O space by ihe core CPU and 
supports bit operation in the I/O space via the PLU. The I/O space is increased 
from 16 ports to 65,536 ports. However, no additional decode circuitry is neces- 


sary if only 16 ports are used. 


The TMS320C5x includes software wait-state generators that are mapped on 
16K-word page sizes in the program and data memory spaces. There are also 
wait-state generators for the I/O ports. The I/O space wait-state generators can 
be mapped on two-word or on 4K-word boundaries. These wait-state genera- 
tors allow the system to be programmed for 0, 1, 2, 3, 4, or 7 wait states, elimi- 
nating the need of an off-chip interfacing circuitry. External access wait states 
can be extended further via the READY signal. 
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) TMS320C5x Development Tools 


Figure D-1, TMS320C5x Development Environment 
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Software Development Tools 


makes it possible to monitor the state of the simulated device. The simulator 
accepts object codes that are produced by the macro assembler/linker system. 
Recent improvements have been made in simulation technology. The 
TMS320C5x software simulator uses a flexible high-level language debug 
monitor user interface. This interface allows the user to view both C language 
and assembly language to be viewed simultaneously. Single-stepping and 
software breakpoints may be executed in either language, providing a means 
for a high-level language debug environment. This interface is used on both 
the SWDS and XDS510, providing an easy transition to other tools. 


These are some key features of the TMS320C5x simulator: 
simulates the entire TMS320C5x instruction set 


simulates the key features of the on-chip TMS320C5x peripherals (serial 
ports and timer) 


has a high-level language debug monitor user interface 


has a windowed, mouse-driven interface, which can be user-customized 


Hod aga 


quickly stores/retrieves the simulation parameters from files to facilitate 
preparation for individual sessions 


i) 


performs reverse-assembly on source assembly code and C code, or al- 
lows both edit and reassembly of the source statements 


simultaneously displays memory in 

@ hexadecimal 16-bit values 

m@ assembled source code 

fiers many execution modes: 
single/multiple instruction count 
single/multiple cycle count 


cs) 
a 
@ until condition is met (UNTIL) 
= 
| 


0 


i 


while condition exists (WHILE) 
for set loop count ( FOR ) 
@ unrestricted run with halt by key input 


During program execution, the internal registers and memory of the simulated 
TMS320C5x are modified as each instruction is interpreted by the host com- 
puter. Execution is suspended when either a breakpoint or an error is encoun- 
tered or when the user halts execution. Once program execution is suspended, 
the internal registers and both program and data memory can be inspected 
and/or modified. Also, the trace memory can be displayed. A record of the sim- 
ulation session can be maintained in a journal file so that it can be re-executed 
to regain the same machine state during another simulation session. 


The simulator allows verification and monitoring of the states of the processor, 
without the requirement of hardware. The TMS320CS5x software simulator op- 
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™@ Watch Window for displaying values of selected variables, registers, 
or other C expressions. The window automatically displays output of 
the correct data type. | 

™@ Display Windows for displaying all field elements of a selected struc- 
ture or array. Display windows understand all data types and automati- 
cally display values as their correct types. If a member of a structure 
or array is another substructure or array, a display window can cause 
children or subwindows to show the substructure or array. 


These items are recommended for the interface: 
Gi ~acolor display for easily recognizing the different display elements. _ 


Cs ~agraphics display adapter (EGA or VGA board). Some boards produce 
a larger screen size, which the debugger takes advantage of. 


[i amouse to take full advantage of the window and menu feature set. 


_ TMS320C5x Development Tools 


Hardware Development Tools _ Snape nu — és ae acaba sae aaa aa ' ere ata'asalalaa'atatata'a alas ata's o <a "i nts sha’ ots 
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bility of the TMS320C5x within your target system. The XDS features real-time 
hardware breakpoint and program execution capabilities from target 
memory. The TMS320C5x JTAG serial-scan path is used to upload and down- 
load both program and data memory and to run all emulator functions. The PC- 
resident XDS has a cable for connection to your target system. The XDS has 
the same user interface as the software simulator and the SWDS. 


Key features of the XDS510 include 


C full- “speed execution and monitoring of the TMS320C5x in your target sys- 
tem via a 14-pin target connector 


loading/inspection/modification of all registers 

upload/download of both program memory and data memory 

high-level language debug monitor user interface | 
single-step execution 

software breakpoint/trace and timing, with up to thirty software breakpoints 
hardware breakpoint/trace on all program addresses 


emulator portability 


OoOoddodgdoeoda 


reconnectability for multiprocessing applications 


benchmark of execution time clock cycles in real time 


_ Full-speed emulation and monitoring of the target system is performed serially 


over a 14-wire cable, which runs from the XDS510 to the taraet system. Four- 
teen signals must be brought out of the target system and into a header si- 
tuated next to the TMS320C5x. The emulation cable is then connected to the 
header. The 14 signals are 4 JTAG ( IEEE standard P1149.1) scan path sig- 
nals, 3 emulation signals, 1 clock signal, and 6 power/ground signals. The | 
JTAG scan path controls the TMS320C5x in the targeted application and pro- 
vides access to all registers as well as to internal and external memory of the 
device. Since program execution takes place on the TMS320C5x in the target 
system, there are no timing differences during emulation. This new emulation 
technology offers significant advantages over the technology of traditional 
emulators. These advantages include 


C4 ‘no transmission problems related to cable length 
a nonintrusive system 
no loading problems on signals 


no artificial memory limitations 


Oooo oad 


acommon screen interface for ease of use 
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Header Signals 


E.1 Header and Header Signals 


To perform emulation with the XDS510, your target system must have a 14-pin 
header (two 7-pin rows) with connections as shown in Figure E-1. Table E—-1 
describes the emulation signals. 


Although you can use other headers, recommended parts include: 
Straight header, unshrouded DuPont Electronics® part number 67996—114 
Right-angle header, unshrouded DuPont Electronics® part number 68405—114 


Figure E-1. 14-Pin Header Signals and Header Dimensions 


TMS TRST 
TDI GND Header Dimensions: 
PD (+5 V) No pin (key) Pin-to-pin spacing: 0.100 in. (X,Y) 
TDO GND Pin width: 0.025 in. square post 
TCK_RET GND Pin length: 0.235 in., nominal 
TCK GND 
EMUO EMU1 


XDS510 Signal ; XDS510 State | Target State Description 
TMS a ae a JTAG test mode select. 
a 
TDO ! JTAG test daia outpui. 
JTAG test clock. TCK is a 10-MHz clock 
source from the emulation cable pod. This 
signal can be used to drive the system test 
clock. 
Tet | 0 | 1 _| sTAGtestreset 
EMUO | ___! | WO | _Emulationpino, 
} EMU1 Emulation pin 1. 
Presence detect. Indicates that the emula- 
tion cable is connected and that the target 
is powered up. PD should be tied to +5 
volts in the target system. 
JTAG test clock return. Test clock input to 
TCK_RET the XDS510 emulator. May be a buffered or 
unbuffered version of TCK. 
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E.3 Cable Pod 


E-4 
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Figure E-2 shows a portion of the XDS510 emulator cable pod. These are the 
functional features of the emulator pod: 


a 


Signals TDO and TCK_RET can be parallel-terminated inside the pod if 
required by the application. The default is that these signals are not termi- 
nated. 


Signal TCK is driven with a 74AS1034 device. Because of the high current 
drive (48 mA Io, /Ioy), this signal can be parallel-terminated. If TCK is tied 
to TCK_RET, then you can use the parallel terminator in the pod. 


Signals TMS and TDI can be generated from the falling edge of TCK_RET, 
according to the IEEE 1149.1 bus slave device timing rules. They can also 
be driven from the rising edge of TCK_RET, which allows a higher 
TCK_RET frequency. The defaultis to match the IEEE 1149.1 slave device 
timing rules. This is an emulator software option that canbe selected when 
the emulator is invoked. In general, single-processor applications can 
benefit from the higher clock frequency. However, in multiprocessing appli- 
cations, you may wish to use the IEEE 1149.1 bus slave timing mode to 
minimize emulation system timing constraints. 


Signals TMS and TDI are series-terminated to reduce signal reflections. 


A 10-MHz test clock source is provided. You may also provide your own 
test clock for greater flexibility. 
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Figure E-3. Emulator Pod Timings 


E-6 


TCK_RET 1.5V 


TMS TDI (Default) 


TMS TDI (Optional) 


[No | Reference | Description 


tT?CKmax 


tTCKhighmin | TCK_RET high pulse duration 


tTCKlowmin TCK_RET low pulse duration 


axTMXmin) TMS/TDI valid from TCK_RET low (default timing) 
d(XTMXmax 


dXTMSmin) TMS/TDI valid from TCK_RET high (optional timing) 
d)XTMSmax) . 


TDO setup time to TCK_RET high 
TDO hold time from TCK_RET high 


It is extremely important to provide high-quality signals between the emulator 
and the target processor. If the distance between the emulation header and the 
processor is greater than 6 inches, the emulation signals should be buffered. 
Sections E.4 and E.5 illustrate typical connections between the target proces- 
sor and the emulation header. 
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Multiprocessor Configura tion 


E.5 Multiprocessor Configuration 


Figure E-5. Multiprocessor Connections 


TMS320C4x TMS320C4x 
TMS320C5x TMS320C5x 


Figure E—-5 shows a typical multiprocessor configuration. This is a daisy- 
chained configuration (TDO-TD! daisy-chained), which meets the minimum re- 
quirements of the IEEE 1149.1 specification. The emulation signals in this ex- 
ample are buffered to isolate the processors from the emulator and provide ad- 
equate signal drive for the target system. One of the benefits of a JTAG test 
interface is that you can generally slow down the test clock to eliminate timing 


problems. Several key points to multiprocessor support are as follows: 


Gi ~The processor TMS, TDI, TDO, and TCK should be buffered through the 
same physical package to control timing skew better. 


Gi The input buffers for TMS, TDI, and TCK should have pullups to 5 volts. 
This will hold these signals at a known value when the emulator is not con- 
nected. A pullup of 4.7 kQ or greater is suggested. 


C4 Buffering EMU0 and EMU1 is optional, but highly recommended to provide 
isolation. These are not critical signals and do not need to be buffered 
through the same physical package as TMS, TCK, TDI, and TDO. Buffered 
and unbuffered signals are shown in Figure E-6 and Figure E-7. 


No signal buffering. In this situation, the distance between the header and 
the processor should be no more than 6 inches. 
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Gi It is extremely important to provide high quality signals, especially on the 
processor TCK and the emulator TCK_RET signal. In some cases, this 
may require you to provide special PWB trace routing and to use termina- 
tion resistors to match the trace impedance. The emulator pod does pro- 
vide optional internal parallel terminators on the TCK_RET and TDO. TMS 
and TDI provide fixed series termination. 
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Case 1: 


Case 2: 


Case 3: 


- tprdtck_TMS 


Single processor, direct connection, TMS/TDI timed from TCK_RET low (de- 
fault timing). 


[t(a(xTMSmax) + tsu(TTMS)] / ttekfactor 
(20ns + £10ns)/0.4 


75 ns (13.3 MHz) 


tordtck_TDO = [t(a(TTDO) + tsu(xTDOmin)] / ttckfactor 


=(15ns + 3ns)/0.4 
= 45ns (22.2 MHz) 


In this case, the TCK/TMS path is the limiting factor. 


Single processor, direct connection, TMS/TDI timed from TCK_RET high (op- 
tional timing). 
tordtck_TMS = td(xTMSmax) + tsu(TTMS) 
(24ns + 10ns) 

34 ns (29.4 MHz) 


tordtck_TDO (TOO) + tsu(xTDOmin)] / ttckfactor 
(1 + 3)/0.4 


45 ns (22.2 MHz) 


Inthis case, the TCK/TDO path is the limiting factor. One other thing to consider 
in this case is the TMS/TDI hold time. The minimum hold time for the XDS510 
cable pod is 7 ns, which meets the 5-ns hold time of the target device. 


Single/multiple processor, TMS/TDI buffered input; TCK_RET/TDO buffered 
output, TMS/TDI timed from TCK_RET high (optional timing). 

tprdtck_TMS = ta(XxTMSmax) + tsu(TTMS) + © ta(bufmax) 
24ns + 10ns + 2 (40) 

54 ns (18.5 MHz) | 


ta(TTDO) + tsu(XTDOmin) + tbufskew 
tickfactor 


(15ns + 3ns + = 1.35ns)/0.4 
58.4 ns (20.7 MHz) 


In this case, the TCK/TMS path is the limiting factor. The hold time on TMS/TDI 
is also reduced by the buffer skew (1.35 ns) but still meets the minimum device 
hold time. 


tordtck_TDO 
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F Memories, Analog Converters, Sockets, and Crystals 


F.1 Memories and Analog Converters 


This section provides product information for EPROM memories, codecs, ana- 
log interface circuits, and A/D and D/A converters. 


Allof these devices can be interfaced with TMS320C5x processors (see Chap- 
ter NO TAG for hardware interface designs). Refer to Digital Signal Processing 
Applications with the TMS320 Family for additional information on interfaces 
using memories and analog conversion devices. 


The following paragraphs give the name of each device and the location of the 
data sheet for that device in order to obtain further specification information if 
desired. 


Data sheets for EPROM memories are located in the MOS Memory Data Book 
(literature number SMYDO08). 


TMS27C64 

TMS27C128 
TMS27C256 
TMS27C512 


Another EPROM memory, TMS27C291/292, is described in a data sheet (liter- 
ature number SMLS291 4A). 


The TCM29C13/14/16/17 codecs and filters are described in the data sheet 
beginning on page 2—111 of the Telecommunications Circuits Data Book (liter- 
ature number SCT001). An analog interface for the DSP using a codec and fil- 
ter is provided by the TCM29C 18/19 data sheet (literature number SCT021). 


The data sheet for the TLC32040 analog interface circuit is provided in the /n- 
terface Circuits Data Book (literature number SLYDO02). 


In the same book are data sheets for A/D and D/A converters. The names of 
the devices are as follows: 


TLC0820 
TLO1205/1225 
TLO7524 
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F.3 Crystals 


This section lists the commonly used crystal frequencies, crystal specification 
requirements, and the names of suitable vendors. . 


Table F—1 lists the commonly used crystal frequencies and the devices with 
which they can be used. 


Table F-1. Commonly Used Crystal Frequencies 


[Device | Frequency 
TMS320C25 40.96 MHz 


TMS320C5x 20.48 MHz 
When connected across X1 and X2/CLKIN of the TMS320 processor, acrystal 


enables the internal oscillator. Crystal specification requirements are listed be- . 
low. 


Load capacitance = 20 pF — 
Series resistance = 30 ohm 
Power dissipation = 1 mW 


Vendors of crystals suitable for use with TMS320 devices are listed below. 


RXD, Inc. 
Norfolk, NB 
(800) 228-8108 


N.E.L. Frequency Controls, Inc. 
Burlington, WI 
(414) 763-3591 


CTS Knight, Inc. 
Contact the local distributor. 
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Figure G-1, 


TMS320 ROM Code Flowchart 


Customer TMS320 Design 


Customer Submits: 
— TMS320 New Code Release Form 

— Print Evaluation and Acceptance Form (PEAF) 
— Purchase Order for Mask Charge Prototypes 
— TMS320 Code 


Texas Instruments Responds: 
— Customer Code Input Into T! System 
— Code Sent Back to Customer for Verification 


Customer 
Approves 
Algorithm 


TI Produces Prototypes 


Customer 
Approves 
Prototypes (Minimum 
Production Order 
Required) 


TMS320 Production 
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G-4 ROM Codes 


H Device and Development Support Tool Nomenclature 


H-2 


Note: 


Predictions show that prototype devices (TMX or TMP) will have a greater 
failure rate than the standard production devices. Texas Instruments recom- 
mends that these devices not be used in any production system, because 
their expected end-use failure rate is still undefined. Only qualified produc- 
tion devices are to be used. 
en | 
Tl device nomenciature also includes a suffix with the device family name. This 
suffix indicates the package type (for example, N, FN, or GB) and temperature 
range (for example, L). Figure H—1 provides a legend for reading the complete 
device name for any TMS320 family member. 


Device and Development Support Too! Nomenclature 


Figure H-2 provides a legend for reading the part number for any TMS320 
hardware or software development tool. 


Figure H-2. TMS320 Development Tool Nomenclature 


TMDS 32 4 281 0-0 2 


QUALIFICATION see! 


TMDX = prototype 
TMDS = qualified 


DEVICE FAMILY 
32 = TMS320 family 


PRODUCT TYPE 


4 = software 
6 = hardware 
8 = upgrade 


MODEL# 


11 = XDS/11 
22 = XDS/22 
88 = upgrade kits 


OPERATING SYSTEMT 
02 = C1x VAX/VMS 
08 = C1x IBM MS/PC-DOS 
22 = C2x VAX/VMS 
28 = C2x IBM MS/PC-DOS 
32 = C3x VAX/VMS 
38 = C3x IBM MS/PC-DOS 
42 = C4x VAX/VMS 
48 = C4x IBM MS/PC-DOS . 
52 = C5x VAX/VMS 
58 = C5x IBM MS/PC-DOS 


Tt Software only. 
+ Hardware only. 


Lenin 


2 = 5-1/4” floppy disk 
8 = 1600 BPI magnetic tape 


S/W FORMATT 
0 = object code 
1 = source code 


SEQUENCE NUMBER? 


GENERATIONt 
1 =C1x 
2 = C2x 
3 = 03x 
4=C4x 
5 = C5x 


FORMATT 
1 = Tl-tagged 
5 = COFF 


Device and Development Support Tool Nomenclature 


circular buffer, 1-6, 4-11, 4-12, 4-13, 6-17, 6-26, 
6-27, 7-12—7-14 

CLKOUT1, 2-6 

CLKR, 2-8 

CLKR pins, 5-18 

CLKX, 2-8 

CLKX pins, 5-18 

clock option, 1-7 

computed GOTO, 3-30 

configuration, multiprocessor, E-8 

context save, 5-7 

context switching, 1-7 

convolution, 1-11, 3-2, 3-21 

CPU, 1-1 

CPU registers, 1-8, 3-34 

Crystals, F-4 

cycle timings (instructions), C-9, C-10, C-11 


data bus (D15~D0), 2-3, 2-4 

data memory, 1-4, 1-8, 1-9, 2-5, 3-2, 3-3, 3-4, 3-6, 
3-7, 3-8, 3-9, 3-10, 3-11, 3-13, 3-16, 3-17, 3-20, 
3-21, 3-22, 3-24, 3-25, 3-29, 3-34, 3-38, 3-41, 
3-43, 3-52, 3-53, 3-57 

data memory page pointer (DP), 3-11, 6-20, 6-21, 
7-2 

data pointer, 3-11 

data receive register (DRR), 5-3 

data transmit register (DXR), 5-3 

decode (pipeline), 3-17, 3-34, 3-35 

delayed branches, 7-18 

design, considerations, E-1 

development tool nomenclature, H-4 

development tools, TMS320CS5x, D-1 

device nomenciature, H-3 

direct addressing mode, 3-11, 3-12, 3-15, 4-2—4-4, 
4-11, 4-108, 4-110, 4-112, 4-114, 4-116, 4-178 

direct memory access (DMA), 6-31, 6-36—6-39 

divide-by-one clock, 1-7, A-12, C-7 

division, 5-23 

DMOV, 3-21, 3-29, 3-43 

DR, 2-2, 2-8 

DS, 2-2 

DX, 2-2, 2-8 - 
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dynamic programming, 7-42—7-54 


echo cancellation, 1-11 

electrical specifications, A-1 

emulation timing calculations, E-11 

emulator, XDS510, D-7 

emulator (XDS), D-8 

extended-precision arithmetic, 3-24, 3-25, 3-29 
external flag (XF) timing, A-18 

external memory interface, C-8 


Fast Fourier Transforms (FFT), 1-11 
filtering, 1-11, 3-2, 3-21 
filters 

FIR, 7-12 

liR, 7-40—7-42 
finite impulse response (FIR) filters, 7-39 
floating-point arithmetic, 7-31—7-35 
format bit (FO), 5-8, 5-16, 5-17, 5-20, 5-25 
four-level pipeline, 3-34, 3-35 
Fourier transforms, 1-11, 7-45—-7-54 
FSX, 2-2, 2-9 
functional block diagram, 3-3, 3-4 


global memory, 6-31—6-33 

global memory allocation register (GREG), 3-6, 3-7, 
3-10, 3-54, 6-18, 6-31, 6-32, 6-33 

graphics, 1-11, 6-37 


hardware development tools, D-7 

hardware multiplier, 3-27 

hardware stack, 1-6, 3-2, 3-9, 3-30, 3-59 
Harvard architecture, 1-4 

header, E-2 

header signals, E-2 

high level language (HLL) debugger, D-5 
HOLD, 3-40, 3-51, 3-54, 3-55, 3-57, 3-58, 3-60 
hold function, 3-58 . 


multiple processors, 6-32 

multiplication, 7-10—7-20 

multiplier, 1-6, 3-2, 3-3, 3-22, 3-24, 3-27, 3-28 
multiprocessing, 6-31, 6-36 


on-chip memory, 1-3, 1-4, 1-5, 1-8, 3-21, 3-39, 3-57 

on-chip program ROM, 1-6 

on-chip RAM, 3-21, 3-42 

on-chip RAM configuration control bit (CNF), 6-5, 
6-13 

on-chip ROM, G-1 

overflow flag (OV), 3-25 

overflow mode (OVM), 3-25 

overflow saturation mode, 3-25 


Parallel Logic Unit (PLU), 4-174, 4-210 


parallel logic unit (PLU), 1-6, 1-7, 5-16, 5-17, 6-16, 
6-18, 6-19, 7-1, 7-7, C-9, C-10 

period register (PRD), 5-8, 5-28, 5-29 

PFC, 3-7, 3-8 

pin assignments, 2-2 

pinouts, 2-2, A-2—A-6 

pipeline operation, 3-30, 3-34 


PMST, 3-37, 3-38, 3-47, 3-49, 4-6, 4-8, 4-11, 4-34, 
4-51, 4-71, 5-4, 5-7, 6-5, 6-7, 6-9, 6-13, 6-18, 


6-27, 7-2, 7-4 

power-down mode, 3-51 

prefetch counter, 3-8 . 

product register (PR), 3-2, 3-16, 3-22, 3-24, 3-27, 
5-7 

program address bus (PAB), 3-30 

program bus, 3-3, 3-29, 3-52 

program counter, 3-2, 3-11, 3-30, 3-54, 3-59 

program counter (PC), C-8 

program execution, 6-40, 7-2 

program memory, 1-4, 1-8, 1-9, 3-2, 3-11, 3-21, 
3-30, 3-55 

protocol, bus, E-3 
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RW, 2-2, 2-4 

RAM, 1-8, 6-14 
See also memory 

RAM overlay bit, 6-13 

RAM overlay bit (OVLY), 7-38 

READY, 2-2, 2-4 

registers 
auxiliary, 1-6, 3-5, 3-16—3-21, 6-16 
memory-mapped, 3-10, 5-2—5-4 
peripheral, 5-2, 5-3 

repeat counter (RPTC), 3-30, 3-34, 3-42, 3-55, 5-2 

reset (RS), 3-20, 3-25, 3-30, 3-42, 3-54, 3-60, C-6, 
C-12 

right shift, 3-25, 3-27, 3-28 

robotics, 1-11 

ROM code flow, G-2 

ROM code media, G-3 

ROM codes, 1-8, G-1 


scaling, 1-7 
scaling shifter, 3-2, 3-22, 3-24, 3-26 
serial port, 1-7, 1-9, 3-51, 3-54, C-6, C-7, C-9, C-10, 
C-12 
serial port timing, A-21, A-22, A-23, A-24, A-25 
shadow registers, 1-6, 4-144, 7-4 
shift modes, 3-27, 3-28 
shifters, 3-22, 3-24 
accumulator, 3-2, 3-16, 3-22, 3-24, 3-25, 3-26 
scaling shifter, 3-2, 3-22, 3-24, 3-26 
sign-extension mode bit (SXM), 3-2, 3-24, 3-27 
signal descriptions, 2-1—2-10, A-2—A-6 
single-instruction repeat (RPT) loops, 7-15—7-17 
sockets, F-3 
software development system (SWDS), C-13, D-7 
software development tools, D-3 
software stack, 7-6 . 
software stack operation, 7-6—7-7 
software wait states, 6-19, 6-34, C-8, C-12 
specifications, A-1 
square root example, 7-18 
stack, 1-6, 3-2, 3-30, 3-34, 3-38, 3-59, 5-7 
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(305) 429-8200; Hall-Mark (305) 971-9280; 
Marshall (305) 977-4880; Lex (305) 977-7511; 
Orlando: Arrow/Kierulff (407) 333-9300; 
Hall-Mark (407) 830-5855; Marshall (407) 
767-8585; Lex (407) 331-7555; Zeus (407) 
365-3000; 

Tampa: Hall-Mark (813) 541-7440; Marshall 
(813) 573-1399; Lex (813) 541-5100. 
GEORGIA: Arrow/Kierulff (404) 497-1300; 
Hall-Mark (404) 623-4400; Marshall (404) 
923-5750; Lex (404) 449-9170. 

ILLINOIS: Arrow/Kierulff (708) 250-0500; 
Hall-Mark (708) 860-3800; Marshall (708) 
490-0155; Newark (312)784-5100; Lex (708) 
330-2888. 

INDIANA: Arrow/Kierulff (317) 299-2071; 
Hall-Mark (317) 872-8875; Marshall (317) 
297-0483; Lex (317) 843-1050. 
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IOWA: Arrow/Kierulff (319) 395-7230; Lex (319) 
373-1417. 

KANSAS: Arrow/Kierulff (913) 541-9542; 
Hall-Mark (913) 888-4747; Marshall (913) 
492-3121; Lex (913) 492-2922. 

MARYLAND: Arrow/Kierulff (301) 995-6002; 
Hall-Mark (301) 988-9800; Marshall (301) 
622-1118; Lex (301) 596-7800; Zeus (301) 
997-1118. 

MASSACHUSETTS: Arrow/Kierulff (508) 
658-0900; Hall-Mark (508) 667-0902; Marshall 
(508) 658-0810; Lex (508) 694-9100; Wyle (617) 
272-7300; Zeus (617) 863-8800. 

MICHIGAN: Detroit: Arrow/Kierulff (313) 
462-2290; Hall-Mark (313) 462-1205; Marshall 
(313) 525-5850; Newark (313) 967-0600; Lex 
(313) 525-8100; 

Grand Rapids: ArrowMKierulff (616) 243-0912. 
MINNESOTA: Arrow/Kierulff (612) 830-1800; 
Hall-Mark (612) 941-2600; Marshall (612) 
559-2211; Lex (612) 941-5280. 

MISSOURI: Arrow/Kierulff (314) 567-6888; 
Hall-Mark (314) 291-5350; Marshall (314) 
291-4650; Lex (314) 739-0526. 

NEW HAMPSHIRE: Lex (800) 833-3557. 

NEW JERSEY: Arrow/Kierulff (201) 538-0900, 
(609) 596-8000; GRS (609) 964-8560; Hall-Mark 
(201) 515-3000, (609) 235-1900; Marshall (201) 
882-0320, (609) 234-9100; Lex (201) 227-7880, 
(609) 273-7900. 

NEW MEXICO: Alliance (505) 292-3360. 

NEW YORK: Long Island: Arrow/Kierulff (516) 
231-1000; Hall-Mark (516) 737-0600; Marshall 
(516) 273-2424; Lex (516) 231-2500; Zeus (914) 
937-7400; 7 
Rochester: ArrowKierulff (716) 427-0300; 
Haill-Mark (716) 425-3300; Marshall (716) 
235-7620; Lex (716) 383-8020; 

Syracuse: Marshall (607) 798-1611. 

NORTH CAROLINA: Arrow/Kierulff (919) 
876-3132; (919) 725-8711; Hall-Mark (919) 
872-0712; Marshall (919) 878-9882; Lex (919) 
876-0000. 

OHIO: Cleveland: ArrowMKierulff (216) 
248-3990; Hall-Mark (216) 349-4632; Marshall 
(216) 248-1788; Lex (216) 464-2970; 
Columbus: Hall-Mark (614) 888-3313; 

Dayton: Arrow/Kierulff (513) 435-5563; Marshall 
(513) 898-4480; Lex (513) 439-1800; Zeus (513) 
293-6162. 

OKLAHOMA: ArowKierulff (918) 252-7537; 
Hall-Mark (918) 254-6110; Lex (918) 622-8000. 
OREGON: Almac (503) 629-8090; ArrowMKierulff 
(503) 627-7667; Marshall (503) 644-5050; Wyle 
(503) 643-7900. 

PENNSYLVANIA: Arrow/Kierulff (215) 928-1800; 
GRS (215) 922-7037; Marshall (412) 788-0441; 
Lex (412) 963-6804. 

TEXAS: Austin: Arrow/Kierulff (512) 835-4180; 
Hall-Mark (512) 258-8848; Lex (512) 339-0088; 
Wyle (512) 345-8853; 

Dallas: Arrow/Kierulff (214) 380-6464; Hall-Mark 
(214) 553-4300; Marshall (214) 233-5200; Lex 
(214) 247-6300; Wyle (214) 235-9953; Zeus 
(214) 783-7010; 

Houston: Arrow/Kierulff (713) 530-4700; 
Hall-Mark (713) 781-6100; Marshall (713) 
895-9200; Lex (713) 784-3600; Wyle (713) 
879-9953. 

UTAH: Arrow/Kierulff (801) 973-6913; Marshall 
(801) 485-1551; Wyle (801) 974-9953, 
WASHINGTON: Almac (206) 643-9992, (509) 
924-9500; Arrow/Kierulff (206) 643-4800; 
Marshall (206) 486-5747; Wyle (206) 881-1150. 
WISCONSIN: Arrow/Kierulff (414) 792-0150; 
Hall-Mark (414) 797-7844; Marshall (414) 
797-8400; Lex (414) 784-9451. 

CANADA: Calgary: Future (403) 235-5325; 
Edmonton: Future (403) 438-2858; 

Montreal: Arrow Canada (514) 735-5511; Future 
(614) 694-7710; Marshall (514) 694-8142; 
Ottawa: Arrow Canada (613) 226-6903; Future 
(tay 820-8313; Quebec City: Arrow Canada 
418) 871-7500; 

Toronto: Arrow Canada (416) 670-7769; Future 
(416) 612-9200; Marshall (416)458-8046; 
Vancouver: Arrow Canada (604) 421-2333; 
Future (604) 294-1166. 
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